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FOREWORD 


(Home) ( About the Author) ( Teaching Units) ( Teaching Slides) 


In 35 years of teaching mathematics to Engineers and Scientists, I 
have frequently been made aware (by students) of a common 
cry for help. "We're coping, generally, with our courses", they may 
say, "but it's Just the Maths". This is the title chosen for the package 
herein. 


Traditional text-books and programmed learning texts can 
sometimes include a large amount of material which is not always 
needed for a particular course; and which can leave students 
feeling that there is too much to cope with. Many such texts are 
biased towards the mathematics required for specific engineering 
or scientific disciplines and emphasise the associated practical 
applications in their lists of tutorial examples. There can also be a 
higher degree of mathematical rigor than would be required by 
students who are not intending to follow a career in mathematics 
itself. 


"Just the Maths" is a collection of separate units, in chronological 
topic-order, intended to service foundation level and first year 
degree level courses in higher education, especially those 
delivered in a modular style. Each unit represents, on average, the 
work to be covered in a typical two-hour session consisting of a 
lecture and a tutorial. However, since each unit attempts to deal 
with self-contained and, where possible, independent topics, it 
may sometimes require either more than or less than two hours 
spent on it. 


"Just the Maths" does not have the format of a traditional 
text-book or a course of programmed learning; but it is written in a 
traditional pure-mathematics style with the minimum amount of 
formal rigor. By making use of the well-worn phrase, "it can be 
shown that", it is able to concentrate on the core mathematical 
techniques required by any scientist or engineer. The techniques 
are demonstrated by worked examples and reinforced by 
exercises that are few enough in number to allow completion, or 
near-completion, in a one-hour tutorial session. Answers to 
exercises are supplied at the end of each unit of work. 


A.J. Hobson 
January 2002 
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ABOUT THE AUTHOR 
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Lecturer in Mathematics of the School of Mathematical and 
Information Sciences at Coventry University. He graduated from 
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a form which may be easily accessible to students of Science and 
Engineerng in the future. 
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“JUST THE MATHS” 


UNIT NUMBER 
1.1 

ALGEBRA 1 
(Introduction to algebra) 

by 

A.J. Hobson 


1.1.1 The Language of Algebra 

1.1.2 The Laws of Algebra 

1.1.3 Priorities in Calculations 

1.1.4 Factors 

1.1.5 Exercises 

1.1.5 Answers to exercises 




UNIT 1.1 - ALGEBRA 1 - INTRODUCTION TO ALGEBRA 


DEFINITION 

An “Algebra” is any Mathematical system which uses the concepts of Equality (=), Addi- 
tion (+), Subtraction (-), Multiplication (x or .) and Division (A). 

Note: 

The Algebra of Numbers is what we normally call “Arithmetic” and, as far as this unit is 
concerned, it is only the algebra of numbers which we shall be concerned with. 

1.1.1 THE LANGUAGE OF ALGEBRA 

Suppose we use the symbols a, b and c to denote numbers of arithmetic; then 

(a) a + b is called the “sum of a and 6” . 

Note: 

a + a is usually abbreviated to 2 a, 
a + a + a is usually abbreviated to 3a and so on. 

(b) a — b is called the “difference between a and 6”. 

(c) a x b, a.b or even just ab is called the “product” of a and b. 

Notes: 

(i) 

a. a is usually abbreviated to a 2 , 

a. a. a is usually abbreviated to a 3 and so on. 

(ii) —1 x a is usually abbreviated to —a and is called the “negation” of a. 

(d) a -r b or | is called the “quotient” or “ratio” of a and b. 

(e) -, [also written a -1 ], is called the “reciprocal” of a. 

(f) | a | is called the “modulus”, “absolute value” or “numerical value” of a. It can 

be defined by the two statements 

| a |= a when a is positive or zero; 

| a |= —a when a is negative or zero. 

Note: 

Further work on fractions (ratios) will appear later, but we state here for reference the rules 
for combining fractions together: 
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Rules for combining fractions together 


1 . 


2 . 


3. 


4. 


a c ad+bc 

b + d = bd 

a c ad — be 

b d bd 

a c a.c 

b d b.d 


a ' c a d a.d 

b d b c b.c 


EXAMPLES 


1. How much more than the difference of 127 and 59 is the sum of 127 and 59 ? 

Solution 

The difference of 127 and 59 is 127—59 = 68 and the sum of 127 and 59 is 127+59 = 186. 
The sum exceeds the difference by 186 — 68 = 118. 

2. What is the reciprocal of the number which is 5 multiplied by the difference of 8 and 2 
? 

Solution 

We require the reciprocal of 5.(8 — 2); that is, the reciprocal of 30. The answer is 
therefore 

3. Calculate the value of 4| — 5| expressing the answer as a fraction. 

Solution 

Converting both numbers to a single fraction, we require 

14 46 _ 126 - 138 _ 12 _ 4 

~3~ ~~ ~9~ “ 27 “ ~27 ~ _ 9 

We could also have observed that the ‘lowest common multiple’ (see later) of the two 
denominators, 3 and 9, is 9; hence we could write the alternative solution 

42 46 _ 4 

9 9 9' 
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4. Remove the modulus signs from the expression | a — 2 | in the cases when (i) a is greater 
than (or equal to) 2 and (ii) a is less than 2. 

Solution 

(i) If a is greater than or equal to 2, 


| a — 2 |= a — 2; 

(ii) If a is less than 2, 

| a — 2 |= —(a — 2) = 2 — a. 

1.1.2 THE LAWS OF ALGEBRA 

If the symbols a, b and c denote numbers of arithmetic, then the following Laws are obeyed 
by them: 

(a) The Commutative Law of Addition a + b = b + a 

(b) The Associative Law of Addition a + (b + c) = (a + b) + c 

(c) The Commutative Law of Multiplication a. b = b. a 

(d) The Associative Law of Multipication a.(b.c) = ( a.b).c 

(e) The Distributive Laws a. {b + c) = a.b + a.c and (a + b).c = a.c + b.c 
Notes: 

(i) A consequence of the Distributive Laws is the rule for multiplying together a pair of 
bracketted expressions. It will be encountered more formally later, but we state it here for 
reference: 


(a + b).(c + d) — a.c + b.c + a.d + b.d 


(ii) The alphabetical letters so far used for numbers in arithmetic have been taken from the 
beginning of the alphabet. These tend to be reserved for fixed quantities called constants. 
Letters from the end of the alphabet, such as w, x, y, z are normally used for quantities 
which may take many values, and are called variables. 

1.1.3 PRIORITIES IN CALCULATIONS 

Suppose that we encountered the expression 5x6 — 4. It would seem to be ambiguous, 
meaning either 30 — 4 = 26 or 5 x 2 = 10. 
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However, we may remove the ambiguity by using brackets where necessary, together with a 
rule for precedence between the use of the brackets and the symbols +, — , x and -4. 

The rule is summarised in the abbreviation 

B.O.D.M.A.S. 

which means that the order of precedence is 


B 

brackets 

0 

First Priority 

O 

of 

X 

Joint Second Priority 

D 

division 

4- 

Joint Second Priority 

M 

multiplication 

X 

Joint Second Priority 

A 

addition 

+ 

Joint Third Priority 

S 

subtraction 

— 

Joint Third Priority 


Thus, 5 x (6 - 4) = 5 x 2 = 10 
but 5 x 6 - 4 = 30 - 4 = 26. 

Similarly, 12 4-3 — 1 = 4 — 1 = 3 
whereas 12 4- (3 — 1) = 12 4- 2 = 6. 

1.1.4 FACTORS 

If a number can be expressed as a product of other numbers, each of those other numbers is 
called a “factor” of the original number. 

EXAMPLES 

1. We may observe that 

70 = 2 x 7 x 5 

so that the number 70 has factors of 2, 7 and 5. These three cannot be broken down into 
factors themselves because they are what are known as “prime” numbers (numbers 
whose only factors are themselves and 1). Hence the only factors of 70, apart from 70 
and 1, are 2, 7 and 5. 

2. Show that the numbers 78 and 182 have two common factors which are prime numbers. 
The two factorisations are as follows: 

78 = 2 x 3 x 13, 


4 



182 = 2 x 7 x 13. 


The common factors are thus 2 and 13, both of which are prime numbers. 


Notes: 

(i) If two or more numbers have been expressed as a product of their prime factors, we may 
easily identify the prime factors which are common to all the numbers and hence obtain the 

“highest common factor”, h.c.f. 

For example, 90 = 2x3x3x5 and 108 = 2x2x3x3x3. Hence the h.c.f = 2x3x3 = 18 

(ii) If two or more numbers have been expressed as a product of their prime factors, we may 
also identify the “lowest common multiple”, l.c.m. 

For example, 15 = 3 x 5 and 20 = 2 x 2 x 5. Hence the smallest number into which both 15 
and 20 will divide requires two factors of 2 (for 20), one factor of 5 (for both 15 and 20) and 
one factor of 3 (for 15). The l.c.m. is thus 2x2x3x5 = 60. 

(iii) If the numerator and denominator of a fraction have factors in common, then such 
factors may be cancelled to leave the fraction in its “lowest terms” . 

For example ^ f 

1.1.5 EXERCISES 

1. Find the sum and product of 

(a) 3 and 6; (b) 10 and 7; (c) 2, 3 and 6; 

(d) | and (e) l| and 7|; (f) 2| and 5^-. 

2. Find the difference between and quotient of 
(a) 18 and 9; (b) 20 and 5; (c) 100 and 20; 

(d) | and (e) 3| and 2|; (f) l| and 5|. 

3. Evaluate the following expressions: 

(a) 6 - 2 x 2; (b) (6 - 2) x 2; 

(c) 6-2-2; (d) (6 4- 2) — 2; 

(e) 6 - 2 + 3 x 2; (f) 6 - (2 + 3) x 2; 

(g) (6 - 2) + 3 x 2; (h) if; (i) -f ; (j) (-6) x (-2). 
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4. Place brackets in the following to make them correct: 

(a) 6 x 12 - 3 + 1 = 55; (b) 6 x 12 - 3 + 1 = 68; 

(c) 6 x 12 - 3 + 1 = 60; (d) 5 x 4 - 3 + 2 = 7; 

(e) 5 x 4 - 3 + 2 = 15; (f) 5 x 4 - 3 + 2 = -5. 

5. Express the following as a product of prime factors: 

(a) 26; (b) 100; (c) 27; (d) 71; 

(e) 64; (f) 87; (g) 437; (h) 899. 

6. Find the h.c.f of 

(a) 12, 15 and 21; (b) 16, 24 and 40; (c) 28, 70, 120 and 160; 

(d) 35, 38 and 42; (e) 96, 120 and 144. 

7. Find the l.c.m of 

(a) 5, 6, and 8; (b) 20 and 30; (c) 7, 9 and 12; 

(d) 100, 150 and 235; (e) 96, 120 and 144. 

1.1.6 ANSWERS TO EXERCISES 


1. 

(a) 

9, 18; (b) 17,70; 

(c) 11,36; (d) §, 

'cb' 

g|5 

°|co 

217. 
20 ’ 

(f) 

154 545 

21 ’ 49 

2. 

(a) 

9,2; (b) 15,4; (c) 

80,5; (d) f: 

/ p \ 37 117. 

V 7 36’ 80 ’ 

(f) 

25 

6 

2 

’ V 

3. 

(a) 

2; (b) 8; (c) 1; (d) 1; (e) 10; 






(0 

-4; (g) 10; (h) - 

-8; (i) 8; (j) 12; 





4. 

(a) 

6 x (12 - 3) + 1 

= 55; (b) 6 x 12 

-(3 + 1) = 

68; 




(c) 

6 x (12-3 + 1) 

= 60; (d) 5 x (4 

- 3) + 2 = 7 

J 




(e) 

5 x 4 - (3 + 2) = 

= 15; (f) 5 x (4 — 

[3 + 2]) = - 

■5. 



5. 

(a) 

2 x 13; (b) 2 x 2 

! x 5 x 5; (c) 3 x 

3x3; (d) 71 x 

i; 



(e) 

2x2x2x2xi 

2 x 2; (f) 3 x 29; 

(g) 19 x 23; 

(h) 

29 : 

< 31. 

6. 

(a) 

3; (b) 8; (c) 2; (d) 1; (e) 24. 





7. 

(a) 

120; (b) 60; (c) 252; (d) 14100; (e) 1440. 
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UNIT 1.2 - - ALGEBRA 2 - NUMBERWORK 


1.2.1 TYPES OF NUMBER 


In this section (and elsewhere) the meaning of the following types of numerical quantity will 
need to be appreciated: 

(a) NATURAL NUMBERS 

These are the counting numbers 1, 2, 3, 4, 

(b) INTEGERS 

These are the positive and negative whole numbers and zero; 
i.e -5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 

(c) RATIONALS 

These are the numbers which can be expressed as the ratio of two integers but can also be 
written as a terminating or recurring decimal (see also next section) 


For example 


and 


2 

5 


= 0.4 


3 

- = 0.428714287142871.... 
7 


(d) IRRATIONALS 

These are the numbers which cannot be expressed as either the ratio of two integers or a 
recurring decimal (see also next section) 

Typical examples are numbers like 

v r ~ 3.1415926 

e ~ 2.71828 

V2 ~ 1.4142135.... 
y/5 ~ 2.2360679.... 

The above four types of number form the system of “real numbers” . 
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1.2.2 DECIMAL NUMBERS 


(a) Rounding to a specified number of decimal places 

Most decimal quantities used in scientific work need to be approximated by “rounding” 
them (up or down as appropriate) to a specified number of decimal places, depending on the 
accuracy required. 

When rounding to n decimal places, the digit in the n-th place is left as it is when the one 
after it is below 5; otherwise it is taken up by one digit. 

EXAMPLES 

1. 362.5863 = 362.586 to 3 decimal places; 

362.5863 = 362.59 to 2 decimal places; 

362.5863 = 362.6 to 1 decimal place; 

362.5863 = 363 to the nearest whole number. 

2. 0.02158 = 0.0216 to 4 decimal places; 

0.02158 = 0.022 to three decimal places; 

0.02158 = 0.02 to 2 decimal places. 

(b) Rounding to a specified number of significant figures 

The first significant figure of a decimal quantity is the first non-zero digit from the left, 
whether it be before or after the decimal point. 

Hence when rounding to a specified number of significant figures, we use the same principle 
as in (a), but starting from the first significant figure, then working to the right. 

EXAMPLES 

1. 362.5863 = 362.59 to 5 significant figures; 

362.5863 = 362.6 to 4 significant figures; 

362.5863 = 363 to 3 significant figures; 

362.5863 = 360 to 2 significant figures; 

362.5863 = 400 to 1 significant figure. 

2. 0.02158 = 0.0216 to 3 significant figures; 0.02158 = 0.022 to 2 significant figures; 
0.02158 = 0.02 to 1 significant figure. 
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1.2.3 THE USE OF ELECTRONIC CALCULATORS 


(a) B.O.D.M.A.S. 

The student will normally need to work to the instruction manual for the particular calculator 
being used; but care must be taken to remember the B.O.D.M.A.S. rule for priorities in 
calculations when pressing the appropriate buttons. 

For example, in working out 7.25 + 3.75 x 8.32, the multiplication should be carried out first, 
then the addition. The answer is 38.45, not 91.52. 

Similarly, in working out 6.95 + [2.43 — 1.62], it is best to evaluate 2.43 — 1.62, then generate 
its reciprocal with the ] button, then multiply by 6.95. The answer is 8.58, not 1.24 

(b) Other Useful Numerical Functions 

Other useful functions to become familiar with for scientific work with numbers are those 

_ i 

indicated by labels such as t/x, x , x y and xv , using, where necessary, the “shift” control 
to bring the correct function into operation. 

For example: 


VT73 ~ 13.153; 
173 2 = 29929; 
23 s = 12167; 
23^ ~ 2.844 


(c) The Calculator Memory 

Familiarity with the calculator’s memory facility will be essential for more complicated cal- 
culations in which various parts need to be stored temporarily while the different steps are 
being carried out. 

For example, in order to evaluate 

(1.4) 3 - 2(1. 4) 2 + 5(1.4) - 3 ~ 2.824 

we need to store each of the four terms in the calculation (positively or negatively) then 
recall their total sum at the end. 
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1.2.4 SCIENTIFIC NOTATION 


(a) Very large numbers, especially decimal numbers are customarily written in the form 

a x 10” 

where n is a positive integer and a lies between 1 and 10. 

For instance, 

521983677.103 = 5.21983677103 x 10 8 . 

(b) Very small decimal numbers are customarily written in the form 

a x HF” 

where n is a positive integer and a lies between 1 and 10. 

For instance, 

0.00045938 = 4.5938 x 10” 4 . 


Note: 

An elec tro nic c alculator will allow you to enter numbers in scientific notation by using the 
or 


EXP 


EE buttons. 


EXAMPLES 


1. Key in the number 3.90816 x 10 57 on a calculator. 

Press 3.90816 EXP 57 

In the display there will now be 3.90816 57 or 3.90816 xl ° 57 . 

2. Key in the number 1.5 x 10 -2 ' on a calculator 
Press 1.5 EXP 27 


+/- 


In the display there will now be 1.5 - 27 or 1.5 xl ° 27 . 


Notes: 

(i) On a calculator or computer, scientific notation is also called floating point notation. 

(ii) When performing a calculation involving decimal numbers, it is always a good idea to 
check that the result is reasonable and that a major arithmetical error has not been made 
with the calculator. 
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For example, 

69.845 x 196.574 = 6.9845 x 10 1 x 1.96574 x 10 3 . 

This product can be estimated for reasonableness as: 

7 x 2 x 1000 = 14000. 

The answer obtained by calculator is 13729.71 to two decimal places which is 14000 when 
rounded to the nearest 1000, indicating that the exact result could be reasonably expected. 

(iii) If a set of measurements is made with an accuracy to a given number of significant 
figures, then it may be shown that any calculation involving those measurements will be 
accurate only to one significant figure more than the least number of significant figures in 
any measurement. 

For example, the edges of a rectangular piece of cardboard are measured as 12.5cm and 
33.43cm respectively and hence the area may be evaluated as 

12.5 x 33.43 = 417.875cm 2 . 


Since one of the edges is measured only to three significant figures, the area result is accurate 
only to four significant figures and hence must be stated as 417.9cm 2 . 

1.2.5 PERCENTAGES 

Definition 

A percentage is a fraction whose denominator is 100. We use the per-cent symbol, %, to 
represent a percentage. 

For instance, the fraction W ma y be written 17% 

EXAMPLES 


1. Express | as a percentage. 

Solution 


2. Calculate 27% of 90. 

Solution 


2 _ 2 20 
5 “ 5 X 20 


40 

100 


40% 


27% of 90 


27 

loo 


x 90 


— x 9 = 24.3 
10 
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3. Express 30% as a decimal. 

Solution 


30% 


30 

loo 


0.3 


1.2.6 RATIO 

Sometimes, a more convenient way of expressing the ratio of two numbers is to use a colon 
(:) in place of either the standard division sign (-=-) or the standard notation for fractions. 

For instance, the expression 7:3 could be used instead of either 7 -f- 3 or |. It denotes that 
two quantities are “in the ratio 7 to 3” which implies that the first number is seven thirds 
times the second number or, alternatively, the second number is three sevenths times the 
first number. Although more cumbersome, the ratio 7:3 could also be written |:1 or 1:|. 

EXAMPLES 

1. Divide 170 in the ratio 3:2 

Solution 

We may consider that 170 is made up of 3 + 2 = 5 parts, each of value 1 1° = 34. 

Three of these make up a value of 3 x 34 = 102 and two of them make up a value of 
2 x 34 = 68. 

Thus 170 needs to be divided into 102 and 68. 

2. Divide 250 in the ratio 1:3:4 

Solution 

This time, we consider that 250 is made up of 1 + 3 + 4 = 8 parts, each of value 
= 31.25. Three of these make up a value of 3 x 31.25 = 93.75 and four of them 
make up a value of 4 x 31.25 = 125. 

Thus 250 needs to be divided into 31.25, 93.75 and 125. 


6 



1.2.7 EXERCISES 


1. Write to 3 s.f. 

(a) 6962; (b) 70.406; (c) 0.0123; 

(d) 0.010991; (e) 45.607; (f) 2345. 

2. Write 65.999 to 

(a) 4 s.f. (b) 3 s.f. (c) 2 s.f. 

(d) 1 s.f. (e) 2 d.p. (f) 1 d.p. 

3. Compute the following in scientific notation: 

(a) (0.003) 2 x (0.00004) x (0.00006) x 5,000,000,000; 

(b) 800 x (0.00001) 2 -T- (200, 000) 4 . 

4. Assuming that the following contain numbers obtained by measurement, use a calculator 
to determine their value and state the expected level of accuracy: 

(a) 

(13.261)°" 5 (1.2) _ 

(5.632) 3 ’ 

(b) 

(8. 342)(— 9.456) 3 
(3.25) 4 ' 

5. Calculate 23% of 124. 

6. Express the following as percentages: 

b ( b ) I; (c) S; ( d ) 1; ( e ) I- 

7. A worker earns £400 a week, then receives a 6% increase. Calculate the new weekly 
wage. 

8. Express the following percentages as decimals: 

(a) 50% (b) 36% (c) 75% (d) 100% (e) 12.5% 

9. Divide 180 in the ratio 8:1:3 

10. Divide 930 in the ratio 1:1:3 

11. Divide 6 in the ratio 2:3:4 

12. Divide 1200 in the ratio 1:2:3:4 

13. A sum of £2600 is to be divided in the ratio 2| : l| : 2|. Calculate the amount of 
money in each part of the division. 
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1.2.8. ANSWERS TO EXERCISES 


1. (a) 6960; (b) 70.4; (c) 0.0123; 

(d) 0.0110; (e) 45.6; (f) 2350. 

2. (a) 66.00; (b) 66.0; (c) 66; 

(d) 70; (e) 66.00; (f) 66.0 

3. (a) 1.08 - 04 or 1.08 xlo “ 4 (b) 5 - 29 or 5 xl0 “ 29 ; 

4. (a) 0.0245, accurate to three sig. figs, (b) —63.22, accurate to four sig. figs. 

5. 28.52 

6. (a) 81.82% (b) 75% (c) 90% (d) 90% (e) 83.33% 

7. £424. 

8. (a) 0.5; (b) 0.36; (c) 0.75; (d) 1; (e) 0.125 

9. 120, 15, 45. 

10. 186, 186, 558. 

11. 1.33, 2, 2.67 

12. 120, 240, 360, 480 

13. £1100, £600 £900. 
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UNIT 1.3 - ALGEBRA 3 - INDICES AND RADICALS (or Surds) 

1.3.1 INDICES 

(a) Positive Integer Indices 

It was seen earlier that, for any number a, a 2 denotes a. a, a 3 denotes a. a. a, a 4 denotes a. a. a. a 
and so on. 

Suppose now that a and b are arbitrary numbers and that m and n are natural numbers (i.e. 
positive whole numbers) 

Then the following rules are the basic Laws of Indices: 

Law No. 1 

a m xa n = a m+n 


Law No. 2 

a m + a n = a m ~ n 

assuming, for the moment, that m is greater than n. 

Note: 

It is natural to use this rule to give a definition to a° which would otherwise be meaningless. 

Clearly ^ — 1 but the present rule for indices suggests that ^ = a m ~ m = a 0 . 

Hence, we define a° to be equal to 1. 


Law No. 3 


(a m ) n = a mn 


a m b m = ( ab) 1 


EXAMPLE 

Simplify the expression, 


x 2 y 3 _ xy 
z z 5 ' 


Solution 

The expression becomes 


x 2 y 3 


,2 m 


x — = xy z 
z xy 
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(b) Negative Integer Indices 
Law No. 4 



a 


Note: 

It has already been mentioned that a -1 means the same as and the logic behind this 
statement is to maintain the basic Laws of Indices for negative indices as well as positive 
ones. 


For example “ +1 is clearly the same as ^ but, using Law No. 2 above, it could also be 
thought of as = a -1 . 

Law No. 5 


Note: 

This time, we may observe that is clearly the same as but we could also use Law 
No. 2 to interpret it as a m ~^ n+n 1 = a~ n 


Law No. 6 


a~°° = 0 


Note: 

Strictly speaking, no power of a number can ever be equal to zero, but Law No. 6 asserts that 
a very large negative power of a number a gives a very small value; the larger the negative 
power, the smaller will be the value. 

EXAMPLE 

Simplify the expression, 

x 5 y 2 z ~ 3 _ z 2 x 2 
X _ 1 y A Z b ' y~ l 


Solution 


The expression becomes 



(c) Rational Indices 

(i) Indices of the form - where n is a natural number. 

In order to preserve Law No. 3, we interpret a« to mean a number which gives the value a 
when it is raised to the power n. It is called an “n-th Root of a” and, sometimes there is 
more than one value. 

ILLUSTRATION 


81 4 = ±3 but (—27)3 = —3 only. 

(ii) Indices of the form — where m and n are natural numbers with no common 

v ' n 

factor. 

The expression may be interpreted in two ways as either (; y m )« or (|/™) m . It may be shown 
that both interpretations give the same result but, sometimes, the arithmetic is shorter with 
one rather than the other. 

ILLUSTRATION 


275 = 3 2 = 9 or 275 = 7295 = 9. 


Note: 

It may be shown that all of the standard laws of indices may be used for fractional indices. 

1.3.2 RADICALS (or Surds) 

The symbol is called a “radical” (or “surd”). It is used to indicate the positive or 
“principal” square root of a number. Thus \/l6 = 4 and v^25 = 5. 

The number under the radical is called the “radicand” . 

Most of our work on radicals will deal with square roots, but we may have occasion to use 
other roots of a number. For instance the principal n-th root of a number a is denoted by 
n \fa 1 and is a number x such that x n = a. The number n is called the index of the radical 
but, of course, when n — 2 we usually leave the index out. 
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ILLUSTRATIONS 


1. 3 v / 64 = 4 since 4 3 = 64. 

2. 3 ^/— 64 = —4 since (— 4) 3 = —64. 

3. 4 V / 8T = 3 since 3 4 = 81. 

4. 5 VS2 = 2 since 2 5 = 32. 

5. 5 v /= 32 = -2 since (-2) 5 = -32. 

Note: 

If the index of the radical is an odd number, then the radicand may be positive or negative; 
but if the index of the radical is an even number, then the radicand may not be negative 
since no even power of a negative number will ever give a negative result. 

(a) Rules for Square Roots 

In preparation for work which will follow in the next section, we list here the standard rules 
for square roots: 

(i) (v^) 2 = « 

(ii) y/tf = | a | 

(iii) Vab = sj~a\fb 



assuming that all of the radicals can be evaluated. 

ILLUSTRATIONS 

1. a/9 x 4 = v^36 = 6 and y/9 x \/4 = 3x2 = 6. 
2- \/¥=v / 4 = 2and^ = f = 2. 
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(b) Rationalisation of Radical (or Surd) Expressions. 

It is often desirable to eliminate expressions containing radicals from the denominator of a 
quotient. This process is called 

rationalising the denominator. 

The process involves multiplying numerator and denominator of the quotient by the same 
amount - an amount which eliminates the radicals in the denominator (often using the fact 
that the square root of a number multiplied by itself gives just the number; 

i.e. y/a.y/a, = a). We illustrate with examples: 

EXAMPLES 

1. Rationalise the surd form 

Solution 

We simply multiply numerator and denominator by \/3 to give 

5 _ 5 s a/3 _ 5 a/3 

4\/3 ~~ 4^3 X v 7 ! _ 12 - ' 

2. Rationalise the surd form 3 /= 

Solution 

Here we observe that, if we can convert the denominator into the cube root of b n , where 
n is a whole multiple of 3, then the square root sign will disappear. 

We have 

3 y/a 3 y Hx 3 \/h 2 3 y/ab 2 3 y/ab 2 

VS ~~ VS X VP ~~ 3 Vb 3 - b 

If the denominator is of the form sfa + y/b, we multiply the numerator and the denom- 
inator by the expression v / a — y/b because 

(y/a + y/b) (y/a - y/b) = a - b. 

3. Rationalise the surd form 

Solution 

Multiplying numerator and denominator by a/5 — y/2 gives 

4 w y/b- y/2 _ 4y/b-4y/2 
y/b + y/2 X y/b- y/2 - 3 ' 
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4. Rationalise the surd form 


a/3-1' 


Solution 

Multiplying numerator and denominator by \/3 + 1 gives 

v^+l v^+l 


X 


V3-1 v^+l 


(c) Changing numbers to 

The modulus of any number 
a m ; i.e. 


and from radical form 

of the form a ^ can be regarded as the principal n-th root of 



If a number of the type shown on the left is converted to the type on the right, we are said 
to have expressed it in radical form. 

If a number of the type on the right is converted to the type on the left, we are said to have 
expressed it in exponential form. 

Note: 

The word “exponent” is just another word for “power” or “index” and the standard 
rules of indices will need to be used in questions of the type discussed here. 

EXAMPLES 


2 

1. Express the number is in radical form. 

Solution 

The answer is just 

2. Express the number 2 3 \/a 5 b 4 in exponential form. 

Solution 

Here we have 

3 v / ^i= (a 5 6 4 )5 =aM. 
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1.3.3 EXERCISES 


1. Simplify 

(a) 5 7 x 5 13 ; (b) 9 8 x 9 5 ; (c) ll 2 x ll 3 x ll 4 . 

2. Simplify 

(a) Sg; (b) % ; (c) g. 

3. Simplify 

(a) a 7 a 3 ' (b) a 4 a 5 ; 

(c) b ll b w b] (d) 3a; 6 x 5a; 9 . 

4. Simplify 

(a) (7 3 ) 2 ; (b) (4 2 )*; (c) (7 s ) 2 . 

5. Simplify 

(a) ( x 2 y 3 )(x 3 y 2 ); (b) (2a; 2 ) (3a; 4 ); 

(c) (a 2 bc 2 )(b 2 ca)] (d) f^r- 

6. Simplify 

(a) (4^ 3 ) 2 (b) a 13 a -2 ; 

(c) x~ 9 x~ 7 ] (d) x~ 2l x 2 x ; 

(e) ^ -r- -rgs. 

\ j z ° x 

7. Without using a calculator, evaluate the following: 

(a) (b) g. 

8. Evaluate the following: 

(a) 64^; (b) 1445; 

(c) 16“5; (d) 25“5; 

(e) 165; (f) 125 _ i. 


9. Simplify the following radicals: 

(a) - 3 v /3 8; (b) V36a; 4 ; (c) y/ljjjp. 

10. Rationalise the following surd forms: 

(a) # (b) W. M (d) 


11. Change the following to exponential form: 
(a) 4 V7 2 ] (b) 5 V a 2 b ; (c) 3 v / 9 6 . 
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12. Change the following to radical form: 

(a) &i; (b) r§; (c) si. 

1.3.4 ANSWERS TO EXERCISES 

1. (a) 5 20 ; (b) 9 13 ; (c) ll 9 . 

2. (a) 15; (b) 4 9 ; (c) 5. 

3. (a) a 10 ; (b) a 9 ; (c) 6 22 ; (d) 15a; 15 . 

4. (a) 7 6 ; (b) 4 16 ; (c) 7 18 . 

5. (a) x 5 y 5 ; (b) 6a; 6 ; (c) a 3 b 3 c 3 \ (d) 2cd. 

6. (a) 4 -6 ; (b) a 11 ; (c) a; -16 ; (d) a;” 18 ; (e) xy 2 z~ 5 . 

7. (a) i; (b) 27. 

8. (a) 4; (b) ±12; (c) ±±; 

(d) ±|; (e) ±64; (f) 

9. (a) 2; (b) 6a; 2 ; (c) \f b . 

10. (a) % (b) ^ = 3 V9] (c) — (2 ± v / 5)(2 ± a/ 3); (d) 

11. (a) |7s|; (b) ahl- (c) 9§. 

12. (a) 5 VP] (b) 3 a/t^; (c) 3 \/sh 
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UNIT 1.4 - ALGEBRA 4 - LOGARITHMS 


1.4.1 COMMON LOGARITHMS 

The system of numbers with which we normally count and calculate has a base of 10; this 
means that each of the successive digits of a particular number correspond to that digit 
multiplied by a certain power of 10. 

For example 

73, 520 = 7 x 10 4 + 3 x 10 3 + 5 x 10 2 + 2 x 10 1 . 


Note: 

Other systems (not discussed here) are sometimes used - such as the binary system which 
uses successive powers of 2. 

The question now arises as to whether a given number can be expressed as a single power 
of 10, not necessarily an integer power. It will certainly need to be a positive number since 
powers of 10 are not normally negative (or even zero). 

It can easily be verified by calculator, for instance that 

1.99526 ~ IQ 0 3 


and 


2 ~ ^gO. 30103 


DEFINITION 

In general, when it occurs that 

x = 10 y , 

for some positive number x , we say that y is the “logarithm to base 10” of x 
(or “ common logarithm” of x) and we write 

y = logio x. 


EXAMPLES 

1. log 10 1.99526 = 0.3 from the illustrations above. 

2. log 10 2 = 0.30103 from the illustrations above. 

3. log 10 1 = 0 simply because 10 n = 1. 
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1.4.2 LOGARITHMS IN GENERAL 


In practice, with scientific work, only two bases of logarithms are ever used; but it will be 
useful to include here a general discussion of the definition and properties of logarithms to 
any base so that unnecessary repetition may be avoided. We consider only positive bases of 
logarithms in the general discussion. 

DEFINITION 

If B is a fixed positive number and x is another positive number such that 

x = B y , 

we say that y is the “logarithm to base R” of x and we write 

y = i°gs x. 


EXAMPLES 

1. log B 1 = 0 simply because R° = 1. 

2. log B B = 1 simply because B 1 = B. 

3. log B 0 doesn’t really exist because no power of B could ever be equal to zero. But, since 
a very large negative power of B will be a very small positive number, we usually write 

logs 0 = -oo. 


1.4.3 USEFUL RESULTS 


In preparation for the general properties of logarithms, we note the following two results 
which can be obtained directly from the definition of a logarithm: 

(a) For any positive number x, 

x = B 1 ° SbX . 

In other words, any positive number can be expressed as a power of B without necessarily 
using a calculator. 


We have simply replaced the y in the statement x = B y by log B x in the equivalent statement 
y = log B x. 


(b) For any number y, 


V = log B B v . 
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In other words, any number can be expressed in the form of a logarithm without necessarily 
using a calculator. 

We have simply replaced x in the statement y = log s x by B y in the equivalent statement 
x = B y . 

1.4.4 PROPERTIES OF LOGARITHMS 

The following properties were once necessary for performing numerical calculations before 
electronic calculators came into use. We do not use logarithms for this purpose nowadays; 
but we do need their properties for various topics in scientific mathematics. 

(a) The Logarithm of Product. 

log B p-q = log B p + log B q. 


Proof: 

We need to show that, when p.q is expressed as a power of B , that power is the expression 
on the right hand side of the above formula. 

From Result (a) of the previous section, 

p q — J^ogg p p>log B q _ J^logg p+log B q 

by elementary properties of indices. 

The result therefore follows. 


(b) The Logarithm of a Quotient 

logs 


P 

q 


logs p - logs q- 


Proof: 

The proof is along similar lines to that in (i). 
From Result (a) of the previous section, 


P 

q 


B 1 °s b p 

_ _gl oggP~log B q 

B 1o Sb <1 


by elementary properties of indices. 
The result therefore follows. 
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(c) The Logarithm of an Exponential 

log B p n = n\og B p, 

where n need not be an integer. 

Proof: 

From Result (a) of the previous section, 

pH — ^ ^log s " _ gn log B p 

by elementary properties of indices. 

(d) The Logarithm of a Reciprocal 

logs - = - logs q- 

q 


Proof: 

This property may be proved in two ways as follows: 

Method 1. 

The left-hand side = log B 1 — log B q = 0 — log B q = — log B q. 

Method 2. 

The left-hand side = log B q~ l = — log B q. 


(e) Change of Base 


logs x 


logn^ 
logA B ' 


Proof: 

Suppose y = log B x, then x = B y and hence 

log^ x = log A B y = y log A B. 


Thus, 

log A x 
V ~ logn B 

and the result follows. 


Note: 

The result shows that the logarithms of any set of numbers to a given base will be directly 
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proportional to the logarithms of the same set of numbers to another given base. This is 
simply because the number log^ B is a constant. 

1.4.5 NATURAL LOGARITHMS 


It was mentioned earlier that, in scientific work, only two bases of logarithms are ever used. 
One of these is base 10 and the other is a base which arises naturally out of elementary 
calculus when discussing the simplest available result for the “derivative” (rate of change) 
of a logarithm. 

This other base turns out to be a non-recurring, non-terminating decimal quantity (irrational 

number) which is equal to 2.71828 and clearly this would be inconvenient to write into 

the logarithm notation. 


We therefore denote it by e to give the “natural logarithm” of a number, x, in the form 
log e a;, although most scientific books use the alternative notation In an 


Note: 

From the earlier change of base formula we can say that 


log 10 x 


log e a; 
log e 10 


and 


log e x 


logio x 
logio e ' 


EXAMPLES 


1. Solve for x the indicial equation 

q 3x-2 _ 2Q x+1 


Solution 

The secret of solving an equation where an unknown quantity appears in a power (or 
index or exponent) is to take logarithms of both sides first. 

Here we obtain 


(3a; - 2) log 10 4 = (x + 1) log 10 26; 


(3a; - 2)0.6021 = 
1.8063a; - 1.2042 = 
(1.8603 - 1.4150)a; = 

0.3913a; = 
2.6192 
X ~ 0.3913 


(x + 1)1.4150; 
1.4150a; + 1.4150; 
= 1.4150 + 1.2042; 
2.6192; 

~ 6.6936 


5 



2. Rewrite the expression 


Ax + log 10 (x + 1) - log 10 x - - log 10 (a: 3 + 2x 2 - x) 


as the common logarithm of a single mathematical expression. 

Solution 

The secret here is to make sure that every term in the given expression is converted, 
where necessary, to a logarithm with no multiple in front of it or behind it. In this case, 
we need first to write Ax = log 10 10 4x and \ log 10 (a; 3 + 2x 2 — x) = log 10 ( x 3 + 2x 2 — x)*. 

We can then use the results for the logarithms of a product and a quotient to give 


log 


10 4x (a; + 1) 

xJ(x 3 + 2x 2 — x) 


3. Rewrite without logarithms the equation 

2x + In x — ln(x — 7). 


Solution 

This time, we need to convert both sides to the natural logarithm of a single mathe- 
matical expression in order to remove the logarithms completely. 

2x + In x = In e 2x + In x = In xe 2x . 


Hence, 


xe 2x = x — 7. 


A. Solve for x the equation 


6 In 4 + In 2 = 3 + In x. 


Solution 

In view of the facts that 6 In 4 = ln4 6 and 3 = lne 3 , the equation can be written 

In 2(4 6 ) = In xe 3 . 


Hence, 
so that 


2(4 6 ) = xe 3 , 



~ 407.856 
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1.4.6 GRAPHS OF LOGARITHMIC AND EXPONENTIAL FUNCTIONS 


In the applications of mathematics to science and engineering, two commonly used “func- 
tions” are y = e x and y = log e x. Their graphs are as follows: 




They are closely linked with each other by virtue of the two equivalent statements: 

P = log e Q and Q = e p 
for any number P and any postive number Q. 

Because of these statements, we would expect similarities in the graphs of the equations 

y = log e x and y = e x . 
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1.4.7 LOGARITHMIC SCALES 


In a certain kind of graphical work (see Unit 5.3), some use is made of a linear scale along 
which numbers can be allocated according to their logarithmic distances from a chosen origin 
of measurement. 

Considering firstly that 10 is the base of logarithms, the number 1 is always placed at the 
zero of measurement (since log 10 1 = 0); 10 is placed at the first unit of measurement (since 
log 10 10 = 1), 100 is placed at the second unit of measurement (since log 10 100 = 2), and so 
on. 

Negative powers of 10 such as lO^ 1 = 0.1, 10~ 2 = 0.01 etc. are placed at the points 
corresponding to —1 and —2 etc. respectively on an ordinary linear scale. 

The logarithmic scale appears therefore in “cycles” , each cycle corresponding to a range of 
numbers between two consecutive powers of 10. 

Intermediate numbers are placed at intervals which correspond to their logarithm values. 
An extract from a typical logarithmic scale would be as follows: 

0.1 0.2 0.3 0.4 1 2 3 4 10 


Notes: 

(i) A given set of numbers will determine how many cycles are required on the logarithmic 
scale. For example .3, .6, 5, 9, 23, 42, 166 will require four cycles. 

(ii) Commercially printed logarithmic scales do not specify the base of logarithms; the change 
of base formula implies that logarithms to different bases are proportional to each other and 
hence their logarithmic scales will have the same relative shape. 



1.4.8 EXERCISES 


1. Without using tables or a calculator, evaluate 

(a) log 10 27 -T- log 10 3; 

0) (login 16 - logm 2 ) + lo Sio 2 - 

2. Using properties of logarithms where possible, solve for x the following equations: 

(a) log 10 l + 2 log 10 | - log 10 ^ = 1 + log 10 x \ 

(b) 2 log 10 6 - (log 10 4 + log 10 9) = log 10 x. 

(c) 10* = 5(2 10 ). 

3. From the definition of a logarithm or the change of base formula, evaluate the following: 

(a) log 2 7; 

(b) log 3 0.04; 

(c) log 5 3; 

(d) 3 log 3 2 — log 3 4 + log 3 \ . 

4. Obtain y in terms of x for the following equations: 

(a) 2 In y = 1 — x 2 ] 

(b) In x — 5 — 3 In y. 

5. Rewrite the following statements without logarithms: 

(a) In x — — \ ln(l — 2u 3 ) + In C; 

(b) ln(l + y) = \x 2 + In 4; 

(c) ln(4 + y 2 ) = 21n(a; + 1) + In A. 

6. (a) If y = 10 ac , find c in terms of the other quantities in the formula. 

(b) If y p = Cx q , find q in terms of the other quantities in the formula. 
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1.4.9 ANSWERS TO EXERCISES 


1. (a) 3; (b) 3. 

2. (a) 4; (b) 1; (c) 3.70927 

3. (a) 2.807; (b) -2.930; (c) 0.683; (d) 0 

4. (a) 

(b) 


y = 



5. (a) 


(b) 


(c) 


C 

~~ Vl - 2v 3 ’ 


1 + y = 4e "2 ; 


4 + y 2 = A(x + l) 2 


6. (a) 


(b) 


using any base. 


c = 



h 

r 


p log y — log C 
logo; 
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UNIT 1.5 - ALGEBRA 5 

MANIPULATION OF ALGEBRAIC EXPRESSIONS 
1.5.1 SIMPLIFICATION OF EXPRESSIONS 

An algebraic expression will, in general, contain a mixture of alphabetical symbols together 
with one or more numerical quantities; some of these symbols and numbers may be bracketted 
together. 

Using the Language of Algebra and the Laws of Algebra discussed earlier, the method of 
simplification is to remove brackets and collect together any terms which have the same 
format 

Some elementary illustrations are as follows: 

1. oAoAg[A3A5A&A5A&A8 = 3 o, A 4 b 11. 

2. lip 2 + 5 q 7 — 8 p 2 A q 7 = 3 p 2 + 6 q 7 . 

3. a(2a — b) A b(a A 5b) — a 2 — 46 2 = 2a 2 — ab A ba A 5 b 2 — a 2 — 46 2 = a 2 + b 2 . 

More frequently, the expressions to be simplified will involve symbols which represent both 
the constants and variables encountered in scientific work. Typical examples in pure math- 
ematics use symbols like x, y and z to represent the variable quantities. 

Further illustrations use this kind of notation and, for simplicity, we shall omit the full-stop 
type of multiplication sign between symbols. 

1. x(2x + 5) + a; 2 (3 — x) = 2x 2 + 5a: + 3a; 2 — x 3 = 5x 2 + 5a; — x 3 . 

2. a; _1 (4a; — x 2 ) — 6(1 — 3x) = 4 — x — 6 + 18a; = 17a; — 2. 

We need also to consider the kind of expression which involves two or more brackets multi- 
plied together; but the routine is just an extension of what has already been discussed. 

For example consider the expression 

(o A 6)(c A d). 


Taking the first bracket as a single item for the moment, the Distributive Law gives 


(o A b)c A (o A b)d. 
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Using the Distributive Law a second time gives 


ac + be + ad + bd. 


In other words, each of the two terms in the first bracket are multiplied by each of the two 
terms in the second bracket, giving four terms in all. 

Again, we illustrate with examples: 

EXAMPLES 

1. (x + 3)(x — 5) = x 2 + 3a: — 5a; — 15 = x 2 — 2x — 15. 

2. (a: 3 — x)(x + 5) = a; 4 — x 2 + 5a; 3 — 5x. 

3. {x + a) 2 = (x + a)(x + a) = x 2 + ax + ax + a 2 = x 2 + 2 ax + a 2 . 

4. {x + a) (x — a) = x 2 + ax — ax — a 2 = x 2 — a 2 . 

The last two illustrations above are significant for later work because they incorporate, 
respectively, the standard results for a “perfect square” and “the difference between 
two squares”. 

1.5.2 FACTORISATION 

Introduction 

In an algebraic context, the word “factor” means the same as “multiplier”. Thus, to 
factorise an algebraic expression, is to write it as a product of separate multipliers or factors. 

Some simple examples will serve to introduce the idea: 

EXAMPLES 

1. 

3a; + 12 = 3(a; + 4). 

2. 

8x 2 — 12x = a;(8a; — 12) = 4a;(2a; — 3). 

3. 


5a; 2 + 15a; 3 = a; 2 (5 + 15a;) = 5a; 2 (1 + 3a;). 
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4 . 


6x + 3a: 2 + 9 xy = x(6 + 3a: + 9 y) = 3a:(2 + x + 3 y). 


Note: 

When none of the factors can be broken down into simpler factors, the original expression 
is said to have been factorised into “irreducible factors” . 

Factorisation of quadratic expressions 

A “quadratic expression” is an expression of the form 

ax 2 + bx + c, 

where, usually, a, b and c are fixed numbers (constants) while a: is a variable number. The 
numbers a and b are called, respectively, the “coefficients” of x 2 and x while c is called 
the “constant term”; but, for brevity, we often say that the quadratic expression has 
coefficients a, b and c. 

Note: 

ft is important that the coefficient a does not have the value zero otherwise the expression 
is not quadratic but “linear”. 

The method of factorisation is illustrated by examples: 

(a) When the coefficient of x 2 is 1 
EXAMPLES 

1 . 


x 2 + 5a: + 6 = (x + m) (x + n) = x 2 + (m + n)x + mn. 

This implies that 5 = m + n and 6 = mn which, by inspection gives m — 2 and n = 3. 
ffence 

x 2 + 5a: + 6 = {x + 2){x + 3). 

2 . 

x 2 + 4x — 21 = (x + m)(x + n ) = x 2 + (m + n)x + mn. 

This implies that 4 = m + n and —21 = mn which, by inspection, gives m = —3 and 
n = 7. Hence 

x 2 + Ax — 21 = (x — 3)(x + 7). 
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Notes: 

(i) In general, for simple cases, it is better to try to carry out the factorisation entirely 
by inspection. This avoids the cumbersome use of m and n in the above two examples as 
follows: 


x 2 + 2x — 8 = (x+?)(x+?). 


The two missing numbers must be such that their sum is 2 and their product is —8. The 
required values are therefore —2 and 4. Hence 

x 2 + 2x — 8 = (x — 2){x + 4). 


(ii) It is necessary, when factorising a quadratic expression, to be aware that either a perfect 
square or the difference of two squares might be involved. In these cases, the factorisation 
is a little simpler. For instance: 

x 2 + 10a; + 25 = (x + 5) 2 

and 

x 2 — 64 = (x — 8)(x + 8). 

(iii) Some quadratic expressions will not conveniently factorise at all. For example, in the 
expression 


x 2 — 13a; + 2, 

we cannot find two whole numbers whose sum is —13 while, at the same time, their product 
is 2. 

(b) When the coefficient of x 2 is not 1 

Quadratic expressions of this kind are usually more difficult to factorise than those in the 
previous paragraph. We first need to determine the possible pairs of factors of the coefficient 
of x 2 and the possible pairs of factors of the constant term; then we need to consider the 
possible combinations of these which provide the correct factors of the quadratic expression. 

EXAMPLES 

1. To factorise the expression 

2x 2 + 11a; + 12, 
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we observe that 2 is the product of 2 and 1 only, while 12 is the product of either 12 and 
1, 6 and 2 or 4 and 3. All terms of the quadratic expression are positive and hence we 
may try (2a; + 1) (a; + 12), (2x + 12)(x + l), (2x + 6 ) (x + 2), (2x + 2 ) (x + 6), (2a; + 4)(a; + 3) 
and (2x + 3) (a; + 4) . Only the last of these is correct and so 

2a; 2 + 11a; + 12 = (2a; + 3)(a; + 4). 

2. To factorise the expression 

6a; 2 + 7x — 3, 

we observe that 6 is the product of either 6 and 1 or 3 and 2 while 3 is the product 
of 3 and 1 only. A negative constant term implies that, in the final result, its two 
factors must have opposite signs. Hence we may try (6a; + 3) (a; — 1), (6a; — 3) (a; + 1), 
(6a; + l)(a; - 3), (6a; - l)(a; + 3), (3a; + 3) (2a; - 1), (3a; - 3) (2a; + 1), (3a; + l)(2a; - 3) 
and (3a; — l)(2a; + 3). Again, only the last of these is correct and so 

6a; 2 + 7x — 3 = (3a; — l)(2x + 3). 


Note: 

The more factors there are in the coefficients considered, the more possibilities there are to 
try of the final factorisation. 


1.5.3 COMPLETING THE SQUARE IN A QUADRATIC EXPRESSION 


The following work is based on the standard expansions 

(x + a) 2 = x 2 + 2 ax + a 2 


and 


(x — a) 2 = x 2 — 2 ax + a 2 . 


Both of these last expressions are called “complete squares” (or “perfect squares”) in 
which we observe that one half of the coefficient of x , when multiplied by itself, gives the 
constant term. That is 

(^ x 2a) 2 = a 2 . 

ILLUSTRATIONS 

1 . 


x 2 + 6a; + 9 = (x + 3) 2 . 
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2 . 

x 2 — 8x + 16 = (x — 4) 2 . 

3. 

4a; 2 — 4x + 1 = 4 x 2 — x + - = 4^3; — -^ . 

L 4j V 2) 

Of course it may happen that a given quadratic expression is NOT a complete square; but, 
by using one half of the coefficient of x, we may express it as the sum or difference of a 
complete square and a constant. This process is called “completing the square”, and the 
following examples illustrate it: 

EXAMPLES 

1. 

x 2 + 6a; + 11 = (x + 3) 2 + 2. 

2. 

x 2 — 8x + 7 = (x — 4) 2 — 9. 

3. 


r 5’ 

4a; 2 — 4a; + 5 = 4 x 2 — x+- 



1.5.4 ALGEBRAIC FRACTIONS 

We first recall the basic rules for combining fractions, namely 


b d 


ad ± be 
bd 


a c ac a c 

b d bd' b d 


ad 
be ’ 
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We also note that a single algebraic fraction may sometimes be simplified by the cancellation 
of common factors between the numerator and the denominator. 

EXAMPLES 


1 . 


2 . 


3. 


25 + 15a 5 + 3a: 


, assuming that x ^ 


4 x 4 . ^ 1 

— = , assuming that x A 0 or . 

3x 2 + x 3x+V 6 3 


x + 2 _ x + 2 

x 2 + 3x + 2 (x + 2) (x + 1) 


, assuming that x A — 1 or —2. 

x + 1 


These elementary principles may now be used with more advanced combinations of algebraic 
fractions 

EXAMPLES 


1. Simplify the expression 

3a: + 6 x + 1 
x 2 + 3x + 2 2x + 8 ’ 

Solution 

Using factorisation where possible, together with the rule for multiplying fractions, we 
obtain 

3 (a: + 2)(a: + 1) _ 3 

2(x + 4:)(x + l)(x + 2) = 2(x + 4) ’ 
assuming that x ^ —1, —2 or —4. 

2. Simplify the expression 

3 _ x 

x + 2 ' 2a: + 4 ’ 

Solution 

Using factorisation where possible together with the rule for dividing fractions, we 
obtain 

3 2a: + 4 3 2(a: + 2) 6 

x = x — = — , 

a: + 2 x x + 2 x x 

assuming that x ^ 0 or —2. 
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3. Express 

4 _ 3 

x + y y 

as a single fraction. 

Solution 

From the basic rule for adding and subtracting fractions, we obtain 

Ay — ?j(x + y) _ y — 3x 
(x + y)y ~ (x + y)y : 

assuming that y ^ 0 and x ^ —y. 

4. Express 

x 4 — x 2 
a; + 1 a; 2 — x — 2 

as a single fraction. 

Solution 

This example could be tackled in the same way as the previous one but it is worth 
noticing that x 2 — x — 2 = (x + l)(a; — 2). Consequently, it is worth putting both 
fractions over the simplest common denominator, namely (x + l)(a; — 2). Hence we 
obtain, if x ^ 2 or — 1, 

x(x — 2) 4 — x 2 _ x 2 — 2x + 4 — x 2 _ 2(2 — x) _ 2 

(x + l)(x — 2) (x + l)(x — 2) (x + l)(x — 2) (x + l)(x — 2) x+1’ 

1.5.5 EXERCISES 

1. Write down in their simplest forms 

(a) 5a — 2b — 3a + 66; (b) 11 p + 5q — 2q + p. 

2. Simplify the following expressions: 

(a) 3a; 2 — 2a; + 5 — x 2 + 7x — 2; (b) x 3 + 5a; 2 — 2a; + 1 + x — x 2 . 

3. Expand and simplify the following expressions: 

(a) x(x 2 — 3x) + a; 2 (4a; + 7); (b) (2a; — l)(2a; + 1) — x 2 + 5a;; 

(c) (a; + 3) (2a; 2 - 5); (d) 2(3a; + l) 2 + 5(a; - 7) 2 . 

4. Factorise the following expressions: 

(a) xy + Ax 2 y] (b) 2 abc — 6 ah 2 ; 

(c) nr 2 + 2nrh] (d) 2 xy 2 z + Ax 2 z. 
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5. Factorise the following quadratic expressions: 

(a) x 2 + 8x + 12; (b) x 2 + llx + 18; (c) x 2 + 13x — 30; 

(d) 3a; 2 + 11a; + 6; (e) 4a; 2 — 12x + 9; (f) 9a; 2 — 64. 

6. Complete the square in the following quadratic expressions: 
(a) x 2 — 10a; — 26; (b) x 2 — 5x + 4; (c) 7x 2 — 2x + 1. 


7. Simplify the following: 


a ,2 +4.x+4 . 


\ a> x 2 +5x+6’ a; 2 +2a;+l , 

assuming the values of x to be such that no denominators are zero. 


x 2 —l 


8. Express each of the following as a single fraction: 

(a) I + (b) -x - b 

+ x+1 7+2’ ^ a ,2 +5;r+4 ~ aj+4> 

assuming that the values of x and y are such that no denominators are zero. 


1.5.6 ANSWERS TO EXERCISES 


1. (a) 2 a + 46; (b) 12p + 3 q. 

2. (a) 2x 2 + 5a; + 3; (b) x 3 + 4a; 2 — x+1. 

3. (a) 5a; 3 + 4a; 2 ; (b) 3a; 2 + 5a; — 1. 

(c) 2a; 3 + 6a; 2 - 5a; - 15; (d) 23a; 2 - 58a; + 247. 

4. (a) xy( 1 + 4a;); (b) 2 ab(c — 36); 

(c) 7T r(r + 2 h); (d) 2 xz(y 2 + 2x). 


5. (a) (x + 2) (a; + 6); (b) (x + 2) (a; + 9); (c) (x — 2)(x + 15; 
(d) (3a; + 2)(x + 3); (e) (2a; - 3) 2 ; (f) (3x - 8)(3x + 8). 


6. (a) (x — 5) 2 — 51; (b) (x - |) 2 


(c) 


7 


x 


w+ 


_ 6 _' 
49. ’ 


7. (a) 


x ~|~2 . 
x-\-3 1 


(b) 


X— 1 

X+1 * 


(b) 


3y+4x 
xy 
2z+3 . 

(x+l)(x+2) ’ 


(d) 


2x — 3 
a; 2 +5a;+4‘ 
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UNIT 1.6 - ALGEBRA 6 - FORMULAE AND ALGEBRAIC EQUATIONS 
1.6.1 TRANSPOSITION OF FORMULAE 


In dealing with technical formulae, it is often required to single out one of the quantities 
involved in terms of all the others. We are said to “transpose the formula” and make 
that quantity “the subject of the equation”. 

In order to do this, steps of the following types may be carried out on both sides of a given 
formula: 

(a) Addition or subtraction of the same value; 

(b) Multiplication or division by the same value; 

(c) The raising of both sides to equal powers; 

(d) Taking logarithms of both sides. 

EXAMPLES 

1. Make x the subject of the formula 

y = 3(x + 7). 

Solution 

Dividing both sides by 3 gives | = x + 7; then subtracting 7 gives x — | — 7. 

2. Make y the subject of the formula 

a = b + c\Jx 2 — y 2 . 

Solution 

(i) Subtracting b gives a — b = c\Jx 2 — y 2 ; 

(ii) Dividing by c gives — = yjx 2 — y 2 ; 

(iii) Squaring both sides gives = x 2 — y 2 ; 

(iv) Subtracting x 2 gives (^) — x 2 = — y 2 \ 

(v) Multiplying throughout by —1 gives x 2 — ( a ~ h ') = V 2 ] 
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(vi) Taking square roots of both sides gives 


y 


= ± 


\ 


x * 



2 


3. Make x the subject of the formula 


e 2 "- 1 



Solution 

Taking natural logarithms of both sides of the formula 

2x — 1 = 3 In y. 


Hence 


x = 


3 In y + 1 
2 


Note: 

A genuine scientific formula will usually involve quantities which can assume only positive 
values; in which case we can ignore the negative value of a square root. 

1.6.2 SOLUTION OF LINEAR EQUATIONS 

A Linear Equation in a variable quantity x has the general form 

ax + b = c. 

Its solution is obtained by first subtracting b from both sides then dividing both sides by a. 
That is 

c — b 

x = . 

a 


EXAMPLES 


1. Solve the equation 


5x + 11 = 20. 


Solution 

The solution is clearly x = 


= 1.8 


5 


9 

5 
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2. Solve the equation 

3 - 7x = 12. 

Solution 

This time, the solution is x = ~ —1.29 

1.6.3 SOLUTION OF QUADRATIC EQUATIONS 

The standard form of a quadratic equation is 

ax 2 + bx + c = 0, 


where a, b and c are constants and a ^ 0. 

We shall discuss three methods of solving such an equation related very closely to the previous 
discussion on quadratic expressions. The first two methods can be illustrated by examples. 

(a) By Factorisation 

This method depends on the ability to determine the factors of the left hand side of the 
given quadratic equation. This will usually be by trial and error. 

EXAMPLES 

1. Solve the quadratic equation 

6a: 2 + x — 2 = 0. 


Solution 

In factorised form, the equation can be written 

(3a; + 2) (2a;- 1) = 0. 

Hence, x — — | or x — 

2. Solve the quadratic equation 

15a; 2 - 17a; - 4 = 0. 


Solution 

In factorised form, the equation can be written 

(5a; + l)(3x — 4) = 0. 

Hence, x = — |ora; = | 
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(b) By Completing the square 


By looking at some numerical examples of this method, we shall be led naturally to a third 
method involving a universal formula for solving any quadratic equation. 

EXAMPLES 

1. Solve the quadratic equation 

x 2 — Ax — 1 = 0. 


Solution 

On completing the square, the equation can be written 

(x — 2) 2 — 5 = 0. 


Thus, 


x- t i = ±VE. 


That is, 


x = 2 ± y/E. 


Left as it is, this is an answer in “surd form” but it could, of course, be expressed in 
decimals as 4.236 and —0.236. 

2. Solve the quadratic equation 

4a; 2 + 4a; — 2 = 0. 


Solution 

The equation may be written 


and, on completing the square, 


x" + x - - 


0 


4 



3 

4 


0 . 


Hence, 



giving 


1 


3 



That is, 



— 1 ± y/Z 


(c) By the Quadratic Formula 

Starting now with an arbitrary quadratic equation 

ax 2 + bx + c = 0, 

we shall use the method of completing the square in order to establish the general solution. 


The sequence of steps is as follows: 


, b c 
x H — x H — 
a a 


= 0 ; 


( b\ b 2 c 
X+ 2a ~4^ + a 


= 0 ; 


( b\ 2 b 2 c 

[ X+ Ta) ~ ~ a’ 

b rp c 
X + 2a = ± V 4^ _ a’ 

b b 2 — 4ac 

X = ~2 o ± V 4a 2 ’ 

—b ± y/b 2 — 4 ac 


Note: 

The quantity b 2 — 4 ac is called the “discriminant” of the equation and gives either two 
solutions, one solution or no solutions according as its value is positive, zero or negative. 

The single solution case is usually interpreted as a pair of coincident solutions while the no 
solution case really means no real solutions. A more complete discussion of this case arises 
in the subject of “complex numbers” (see Unit 6.1). 
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EXAMPLES 


Use the quadratic formula to solve the following: 

1 . 


x 2 + 2x — 35 = 0. 


Solution 


2 . 


-2 ± V4 + 140 


-2 ± 12 
2 


5 or — 7. 


2x 2 — 3x — 7 = 0. 


Solution 


3. 


x = 


3 ± ^9 + 56 3 ± VQ5 3 ± 8.062 


~ 2.766 or 


1.266 


9a: 2 — 6a: + 1 = 0. 


Solution 


4. 


x = 


6 ± V36 - 36 
18 


6 

18 


1 

3 


only. 


5a: 2 + x + 1 = 0. 


Solution 


x = 


Hence, there are no real solutions 


-1 ± VI - 20 
10 
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1.6.4 EXERCISES 


Make the given symbol the subject of the following formulae 

(a) x : a(x — a) = b(x + 6); 

(b) b : a = 2 ~ lb - 


3+5 6’ 

( c ) r: n = 2 Z\/F ; 


(d) x: ye 


x 2 +l _ 


= 5. 


2. Solve, for x, the following equations 

(a) 14x = 35; 

(b) 3x — 4.7 = 2.8; 

(c) 4(2x — 5) = 3(2x + 8). 


3. Solve the following quadratic equations by factorisation: 

(a) x 2 + 5x — 14 = 0; 

(b) 8x 2 + 2x — 3 = 0. 


4. Where possible, solve the following quadratic equations by the formula: 
(a) 2x 2 — 3a: + 1 = 0; (b) 4x = 45 — x 2 ; 

(c) 16x 2 - 24x + 9 = 0; (d) 3x 2 + 2x + 11 = 0. 


1.6.5 ANSWERS TO EXERCISES 


1. (a) X = 

(b) b = 


b 2 +a 2 , 
a-b ’ 
2— 3a . 


7+5a ’ 

(c) r = 4 n 2 L 2 p; 

(d) x = ±Vln5 — In y — 1. 


2. (a) 2.5; (b) 2.5; (c) 22. 


3. (a) 

x = 

-7, x = 2; 

(b) 

x = 

3 „. _ 1 
4’ x 2 

4. (a) 

X = 

1, X 2 ’ 

(b) 

X = 

5, x = -9: 

(c) 

X = 

| only; 

(d) 

No : 

solutions. 
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UNIT 1.7 - ALGEBRA 7 - SIMULTANEOUS LINEAR EQUATIONS 
Introduction 

When Mathematics is applied to scientific work, it is often necessary to consider several 
statements involving several variables which are required to have a common solution for 
those variables. We illustrate here the case of two simultaneous linear equations in two 
variables x and y and three simultaneous linear equations in three unknowns x, y and z. 

1.7.1 TWO SIMULTANEOUS LINEAR EQUATIONS IN TWO UNKNOWNS 

Consider the simultaneous linear equations: 


ax + by = p, 
cx + dy = q. 

To obtain the solution we first eliminate one of the variables in order to calculate the other. 
For instance, to eliminate x, we try to make coefficient of x the same in both equations so 
that, subtracting one statement from the other, x disappears. In this case, we multiply the 
first equation by c and the second equation by a to give 


cax + cby = cp, 
acx + ady = aq. 

Subtracting the second equation from the first, we obtain 

y(cb — ad) = cp — aq 

which, in turn, means that 

y = — provided cb — ad ^ 0. 

cb — ad 

Having found the value of y, we could then either substitute back into one of the original 
equations to find x or begin again by eliminating y. 

However, it is better not to think of the above explanation as providing a formula for solving 
two simultaneous linear equations. Rather, a numerical example should be dealt with from 
first principles with the numbers provided. 
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Note: 

cb — ad = 0 relates to a degenerate case in which the left hand sides of the two equations are 
proportional to each other. Such cases will not be dealt with at this stage. 

EXAMPLE 

Solve the simultaneous linear equations 

6x-2y = 1, (1) 

4a; + 7 y = 9. (2) 


Solution 

Multiplying the first equation by 4 and the second equation by 6, 


24a; — 8 y = 4, (4) 

24a; + 42 y = 54. (5) 


Subtracting the second of these from the first, we obtain —50 y = —50 and hence y — 1. 
Substituting back into equation (1), 6a; — 2 = 1, giving 6a; = 3, and, hence, x — 

Alternative Method 

Multiplying the first equation by 7 and the second equation by —2, we obtain 

42z-14 y = 7, (5) 

-8x-Uy = -18. (6) 

Subtracting equation (6) from equation (5) gives 50a; = 25 and hence, x = 

Substituting into equation (1) gives 3 — 2y = 1 and hence, —2 y = —2; that is y — 1. 
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1.7.2 THREE SIMULTANEOUS LINEAR EQUATIONS IN THREE UNKNOWNS 

Here, we consider three simultaneous equations of the general form 

a\x + biy + c\z = ki, 
a 2 x + b 2 y + c 2 z = k 2 , 
a 3 x + b 3 y + c 3 z = k 3 ; 

but the method will be illustrated by a particular example. 

The object of the method is to eliminate one of the variables from two different pairs of 
the three equations so that we are left with a pair of simultaneous equations from which to 
calculate the other two variables. 

EXAMPLE 

Solve, for x, y and z, the simultaneous linear equations 

x — y + 2z = 9, (1) 

2x + y — z = l, (2) 

3a; — 2 y + z = 8. (3) 

Solution 

Firstly, we may eliminate z from equations (2) and (3) by adding them together. We obtain 

5a; — y = 9. (4) 

Secondly, we may eliminate z from equations (1) and (2) by doubling equation (2) and adding 
it to equation (1). We obtain 

5 x + y = 11. (5) 

If we now add equation (4) to equation (5), y will be eliminated to give 
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1 Ox = 20 or x — 2. 


Similarly, if we subtract equation (4) from equation (5), x will be eliminated to give 


2y = 2 or y = 1. 

Finally, if we substitute our values of x and y into one of the original equations [say equation 
(3)] we obtain 


z = 8 — 3a; + 2y = 8 — 6 + 2 = 4. 


Thus, 


x = 2, y = 1 and z — 4. 

1.7.3 ILL-CONDITIONED EQUATIONS 

In the simultaneous linear equations of genuine scientific problems, the coefficients will often 
be decimal quantitities that have already been subjected to rounding errors; and the solving 
process will tend to amplify these errors. The result may be that such errors swamp the 
values of the variables being solved for; and we have what is called an “ill-conditioned” 
set of equations. The opposite of this is a “well-conditioned” set of equations and all of 
those so far discussed have been well-conditioned. But let us consider, now, the following 
example: 

EXAMPLE 

The simultaneous linear equations 


x + y = 1, 
1.001 x + y = 2 


have the common solution x = 1000, y = —999. 
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However, suppose that the coefficient of x in the second equation is altered to 1.000, which 
is a mere 0.1%. Then the equations have no solution at all since x + y cannot be equal to 1 
and 2 at the same time. 

Secondly, suppose that the coefficient of x in the second equation is altered to 0.999 which 
is still only a 0.2% alteration. 

The solutions obtained are now x = —1000, y = 1001 and so a change of about 200% has 
occurred in original values of x and y. 

1.7.4 EXERCISES 

1. Solve, for x and y, the following pairs of simultaneous linear equations: 

(a) 

x — 2 y = 5, 

3 x + y = 1; 

(b) 

2x + 3y = 42, 

5 x — y = 20. 


2. Solve, for x, y and z, the following sets of simultaneous equations: 

(a) 

x + y + z = 0, 

2x — y — 3z = 4, 

3x + 3y = 7; 

(b) 

x + y — 10 = 0, 
y + z — 3 = 0, 
x + z + 1 = 0; 

(c) 

2x — y — z = 6, 
x + 3y + 2z = 1, 

3x — y — 5z = 1; 
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2x — 5y + 2z = 14, 
9a; + 3y — 4z = 13, 
7x + 3y — 2z = 3; 

4x — 7y + 6z = —18, 
5x + y — 4z = —9, 
3x — 2y + 3z = 12. 


3. Solve the simultaneous linear equations 


1.985a; — 1.358?/ = 2.212, 
0.953a; - 0.652a/ = 1.062, 


and compare with the solutions obtained by changing the constant term, 1.062, of the 
second equation to 1.061. 


1.7.5 ANSWERS TO EXERCISES 

1. (a) x — 1, y — —2; (b) x — 6, y — 10. 

2. (a) x = -§, y = f , z = 

(b) x = 3, y = 7, z = -4; 

(c) x = 3, y = -2, z = 2; 

(d) a; = 1, y = -4, z = -4; 

(e) a; = 3, y = 12, z — 9. 

3. 

a; ~ 0.6087, y ~ —0.7391 

compared with 


a; ~ 30.1304, y ~ 42.413 
a change of 4850% in x and 5839% in y. 
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UNIT 1.8 - ALGEBRA 8 - POLYNOMIALS 


Introduction 

The work already covered in earlier units has frequently been concerned with mathemati- 
cal expressions involving constant quantities together positive integer powers of a variable 
quantity (usually x). The general form of such expressions is 

Gq Qj\X T T CL^X^ T T Qj n X n 


which is called a 

“polynomial of degree n in x” , having “coefficients” 00 , 01 , 02 , 03 , a n , usually con- 

stant. 

Note: 

Polynomials of degree 1, 2 and 3 are called respectively “linear”, “quadratic” and “cubic” 
polynomials. 

1.8.1 THE FACTOR THEOREM 

If P(x) denotes an algebraic polynomial which has the value zero when x = a, then x — a is 
a factor of the polynomial and 

P(x ) = (x — o)x another polynomial, Q(x), of one degree lower. 

Notes: 

(i) The statement of this Theorem includes some functional notation (i.e. P(x), Q(x )) which 
will be discussed fully as an introduction to the subject of Calculus in Unit 10.1. 

(ii) x = a is called a “root” of the polynomial. 

1.8.2 APPLICATION TO QUADRATIC AND CUBIC EXPRESSIONS 
(a) Quadratic Expressions 

Suppose we are given a quadratic expression where we suspect that at least one of the values 
of x making it zero is a whole number. We may systematically try 

x = 0) 1) — 1) 2, —2, 3, —3, 4, —4, 

until such a value of x (say x = a) is located. Then one of the factors of the quadratic 
expression is x — a enabling us to determine the other factor(s) easily. 
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EXAMPLES 


1. By trial and error, the quadratic expression 

x 2 + 2x — 3 

has the value zero when x — 1; hence, x — 1 is a factor. 

By further trial and error, the complete factorisation is 

(x — 1 ) (.r — 3). 

2. By trial and error, the quadratic expression 

3a: 2 + 20a: — 7 

has the value zero when x = —7; hence, (x + 7) is a factor. 
By further trial and error, the complete factorisation is 

(x + 7) (3a:- 1). 


(b) Cubic Expressions 

The method just used for the factorisation of quadratic expressions may also be used for 
polynomials having powers of x higher than two. In particular, it may be used for cubic 
expressions whose standard form is 

ax 3 + bx 2 + cx + d, 


where a, b , c and d are constants. 

EXAMPLES 

1. Factorise the cubic expression 

x 3 + 3a: 2 — x — 3 

assuming that there is at least one whole number value of x for which the expression 
has the value zero. 

Solution 

By trial and error, the cubic expression has the value zero when x — 1. Hence, by the 
Factor Theorem, (a: — 1) is a factor. 

Thus, 

x 3 + 3a: 2 — x — 3 = (x — 1 )Q(x) 
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where Q(x) is some quadratic expression to be found and, if possible, factorised further. 
In other words, 

a; 3 + 3a; 2 — x — 3 = (x — 1) ( px 2 + qx + r) 
for some constants p, q and r. 

Usually, it is fairly easy to determine Q(x) using trial and error by comparing, initially, 
the highest and lowest powers of x on both sides of the above identity; then comparing 
any intermediate powers as necessary. We obtain 

x 3 + 3a; 2 — x — 3 = (x — l)(a; 2 + 4a; + 3). 

In this example, the quadratic expression does factorise even further giving 

x 3 + 3a; 2 — x — 3 = (x — l)(x + l){x + 3). 

2. Factorise the cubic expression 

x 3 + Ax 2 + Ax + 1. 


Solution 

By trial and error, we discover that the cubic expression has value zero when x — — 1, 
and so x + 1 must be a factor. 

Hence, 

x 3 + 4a; 2 + 4a; + 1 = (x + 1 )Q(x), 
where Q(x) is a quadratic expression to be found. In other words, 

x 3 + 4a; 2 + 4a; + 1 = (x + 1) ( px 2 + qx + r) 

for some constants p, q and r. 

Comparing the relevant coefficients, we obtain 

x 3 + 4a; 2 + 4a; + 1 = (x + l)(a; 2 + 3a; + 1) 

but, this time, the quadratic part of the answer will not conveniently factorise further. 

1.8.3 CUBIC EQUATIONS 

There is no convenient general method of solving a cubic equation; hence, the discussion 
here will be limited to those equations where at least one of the solutions is known to be a 
fairly small whole number, that solution being obtainable by trial and error. 

We illustrate with examples. 
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EXAMPLES 


1. Solve the cubic equation 


x 3 + 3a; 2 — x — 3 = 0. 


Solution 

By trial and error, one solution is x — 1 and so {x — 1) must be a factor of the left hand 
side. In fact, we obtain the new form of the equation as 

(x — l)(x 2 + 4a; + 3) = 0. 


That is, 


(x — l)(a; + l)(a; + 3) = 0. 


Hence, the solutions are x — 1, x — — 1 and x = —3. 


2. Solve the cubic equation 

2a; 3 — 7x 2 + 5a; + 54 = 0. 


Solution 

By trial and error, one solution is x = —2 and so {x + 2) must be a factor of the left 
hand side. In fact we obtain the new form of the equation as 

(x + 2) (2a; 2 - 11a; + 27) = 0. 

The quadratic factor will not conveniently factorise, but we can use the quadratic for- 
mula to determine the values of x, if any, which make it equal to zero. They are 

11 ± y/121^216 


The discriminant here is negative so that the only (real) solution to the cubic equation 
is x = —2. 

1.8.4 LONG DIVISION OF POLYNOMIALS 
(a) Exact Division 

Having used the Factor Theorem to find a linear factor of a polynomial expression, there 
will always be a remaining factor, Q(x), which is some other polynomial whose degree is one 
lower than that of the original; but the determination of Q(x) by trial and error is not the 
only method of doing so. An alternative method is to use the technique known as “long 
division of polynomials” ; and the working is illustrated by the following examples: 
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EXAMPLES 


1. Factorise the cubic expression 

x 3 + 3a; 2 — x — 3. 

Solution 

By trial and error, the cubic expression has the value zero when x = 1 so that {x — 1) 
is a factor. 

Dividing the given cubic expression (called the “dividend”) by {x — 1) (called the 
“divisor”), we have the following scheme; 

x 2 + 4a; + 3 
x — l)a; 3 + 3a; 2 — x — 3 
x° — X 

4a; 2 — x — 3 
4a; 2 — 4a; 

3a; — 3 
3a; — 3 
0 

Note: 

At each stage, using positive powers of x or constants only, we examine what the highest 
power of x in the divisor would have to be multiplied by in order to give the highest 
power of x in the dividend. The results are written in the top line as the “quotient” 
so that, on multiplying down, we can subtract to find each “remainder”. 

The process stops when, to continue would need negative powers of x; the final remain- 
der will be zero when the divisor is a factor of the original expression. 

We conclude here that 

x 3 + 3a; 2 — x — 3 = (x — l)(x 2 + 4a; + 3). 

Further factorisation leads to the complete result, 

x 3 + 3a; 2 — x — 3 = (a; — l)(x + l)(x + 3). 

2. Solve, completely, the cubic equation 

x 3 + 4a; 2 + 4a; + 1 = 0. 

Solution 

By trial and error, one solution is x = —1 so that (x + 1) is a factor of the left hand 
side. 
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Dividing the left hand side of the equation by (x + 1), we have the following scheme: 

x 2 + 3a; + 1 
x + l)a; 3 + 4a; 2 + 4x + 1 
a; 3 + x 2 

3x 2 + 4x + 1 
3a: 2 + 3a; 
x + 1 
x + 1 
0 

Hence, the equation becomes 

(x + l)(a; 2 + 3a; + 1) = 0, 
giving x — — 1 and x = ~ —0.382 or — 2.618 


(b) Non-exact Division 

The technique for long division of polynomials may be used to divide any polynomial by 
another polynomial of lower or equal degree, even when this second polynomial is not a 
factor of the first. 

The chief difference in method is that the remainder will not be zero; but otherwise we 
proceed as before. 

EXAMPLES 


1. Divide the polynomial 6a; + 5 by the polynomial 3a; — 1. 

Solution 


2 


3a; — l)6a; + 5 
6a; — 2 
7 


Hence, 


6a; + 5 _ o 7 
3a; — 1 = + 3a;- 1' 
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2. Divide 3a; 1 2 + 2x by x + 1. 

Solution 


Hence, 


3x- 1 

x + l)3a; 2 + 2x 
3x 2 + 3x 

—x 

—x — 1 
1 


3a; 2 + 2a; 1 

= 3a; — 1 H . 

x + 1 x + 1 


3. Divide x 4 + 2a; 3 — 2a; 2 + 4a; — 1 by x 2 + 2a; — 3. 

Solution 


x 2 + 1 

x 2 + 2x — 3)x 4 + 2a; 3 — 2a; 2 + 4a; — 1 
x 4 + 2a; 3 — 3a; 2 

x 2 + 4x — 1 
x 2 + 2x — 3 
2x + 2 


Hence, 

x 4 + 2a; 3 — 2a; 2 + 4a; — 1 _ 2 2a; + 2 

x 2 + 2a; — 3 X x 2 + 2a; — 3 ’ 


1.8.5 EXERCISES 

1. Use the Factor Theorem to factorise the following quadratic expressions: assuming that 
at least one root is an integer: 

(a) 2x 2 + 7x + 3; 

(b) 1 — 2a; — 3a; 2 . 

2. Factorise the following cubic polynomials assuming that at least one root is an integer: 

(a) x 3 + 2a; 2 — x — 2; 

(b) x 3 — x 2 — 4; 

(c) x 3 — 4x 2 + 4a; — 3. 
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3. Solve completely the following cubic equations assuming that at least one solution is an 
integer: 

(a) x 3 + 6a; 2 + 11a; + 6 = 0; 

(b) x 3 + 2x 2 — 31a; + 28 = 0. 

4. (a) Divide 2a; 3 — 11a; 2 + 18a; — 8 by x — 2; 

(b) Divide 3a; 3 + 12a; 2 + 13a; + 4 by x + 1. 

5. (a) Divide x 2 — 2x + 1 by x 2 + 3a; — 2, expressing your answer in the form of a constant 

plus a ratio of two polynomials. 

(b) Divide x 5 by x 2 — 2x — 8, expressing your answer in the form of a polynomial plus 
a ratio of two other polynomials. 

1.8.6 ANSWERS TO EXERCISES 


1. (a) (2a; + l)(a; + 3); (b) (x + 1)(1 — 3x). 

2. (a) ( x — l)(x + l)(a; + 2); (b) (x — 2)(x 2 + x + 2); (c) (x — 3)(a; 2 — x + 1). 

3. (a) x = —1, x = —2, x = —3; (b) x — 1, x — 4, x — —7. 

4. (a) 2x 2 — 7x + 4; (b) 3a; 2 + 9a; + 4. 

5. (a) 

3 — 5a; 

+ a; 2 + 3a; — 2 ’ 


(b) 


x 3 + 2x 2 + 12a; + 40 + 
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176a; + 320 
x 2 — 2x — 8 ’ 
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UNIT 1.9 - ALGEBRA 9 - THE THEORY OF PARTIAL FRACTIONS 


1.9.1 INTRODUCTION 

The theory of partial fractions applies chiefly to the ratio of two polynomials in which the 
degree of the numerator is strictly less than that of the denominator. Such a ratio is called 

a “proper rational function”. 

For a rational function which is not proper, it is necessary first to use long division of 
polynomials in order to express it as the sum of a polynomial and a proper rational function. 

RESULT 

A proper rational function whose denominator has been factorised into its irreducible factors 
can be expressed as a sum of proper rational functions, called “partial fractions”; the 
denominators of the partial fractions are the irreducible factors of the denominator in the 
original rational function. 

ILLUSTRATION 

From previous work on fractions, it can be verified that 

1 3 7x + 8 

2x + 3 x — 1 (2x + 3)(x — 1) 

and the expression on the left hand side may be interpretted as the decomposition into 
partial fractions of the expression on the right hand side. 

1.9.2 STANDARD TYPES OF PARTIAL FRACTION PROBLEM 
(a) Denominator of the given rational function has all linear factors. 

EXAMPLE 

Express the rational function 

7x + 8 

(2x + 3)(Y — 1) 

in partial fractions. 

Solution 

There will be two partial fractions each of whose numerator must be of lower degree than 1; 
i.e. it must be a constant 

We write 

7x + 8 _ A B 

(2x + 3) (x — 1) 2x + 3 x — 1 ’ 
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Multiplying throughout by (2x + 3)(x — 1), we obtain 


7x + 8 = A(x - 1) + B{2x + 3). 

In order to determine A and B , any two suitable values of x may be substituted on both 
sides; and the most obvious values in this case are x = 1 and x = — |. 

It may, however, be argued that these two values of x must be disallowed since they cause 
denominators in the first identity above to take the value zero. 

Nevertheless, we shall use these values in the second identity above since the arithmetic 
involved is negligibly different from taking values infinitesimally close to x — 1 and x — — |. 

Substituting x = 1 gives 
Hence, 


7 + 8 = 5(2 + 3). 


fl = I±» = “ =s . 

2 + 3 5 


Substituting x — — | gives 


Hence, 


7x+ + 8 = V-5-1). 


,4= 7x -i + 


2 - 1 
2 - 1 - 


5 

1 = 1 - 
2 


We conclude that 


7x + 8 


(2x + 3) (x — 1) 2x + 3 x — 1 


The “Cover-up” Rule 

A useful time-saver when the factors in the denominator of the given rational function are 
linear is to use the following routine which is equivalent to the method described above: 

To obtain the constant numerator of the partial fraction for a particular linear factor, ax + b, 
in the original denominator, cover up ax + b in the original rational function and then 
substitute x — — - into what remains. 

a 

ILLUSTRATION 

In the above example, we may simply cover up x — 1, then substitute x — 1 into the fraction 

7x + 8 
2x + 3’ 
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Then we may cover up 2x + 3 and substitute x — — § into the fraction 

7x + 8 

x — 1 


Note: 

We shall see later how the cover-up rule can also be brought into effective use when not all 
of the factors in the denominator of the given rational function are linear. 

(b) Denominator of the given rational function contains one linear and one 
quadratic factor 

EXAMPLE 

Express the rational function 

3a; 2 + 9 

(x - 5)(x 2 + 2x + 7) 

in partial fractions. 

Solution 

We should observe firstly that the quadratic factor will not reduce conveniently into two 
linear factors. If it did, the method would be as in the previous paragraph. Hence we may 
write 


3a: 2 + 9 _ A Bx + C 

(a;-5)(a; 2 + 2a; + 7) = x - 5 + x^T^xT?’ 

noticing that the second partial fraction may contain an x term in its numerator, yet still 
be a proper rational function. 

Multiplying throughout by (x — 5)(x 2 + 2x + 7), we obtain 

3a; 2 + 9 = A(x 2 + 2a; + 7) + (Bx + C)(x — 5). 

A convenient value of x to substitute on both sides is x = 5 which gives 

3 x 5 2 + 9 = A(5 2 + 2x5 + 7). 

That is, 84 = 42A or A = 2. 

No other convenient values of x may be substituted; but two polynomial expressions can be 
identical only if their corresponding coefficients are the same in value. We therefore equate 
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suitable coefficients to find B and C\ usually, the coefficients of the highest and lowest powers 
of x. 

Equating coefficients of a; 2 , 3 = A + B and hence B — 1. 

Equating constant terms (the coefficients of re 0 ), 9 = 7 A — 5 C = 14 — 5C and hence C — 1. 
The result is therefore 


3a; 2 + 9 _ 2 x + 1 

(x - 5)(x 2 + 2x + 7) = a; — 5 + x 2 + 2x + 7' 


Observations 

It is easily verified that the value of A may be calculated by means of the cover-up rule, 
as in paragraph (a); and, having found A, the values of B and C could be found by cross 
multiplying the numerators and denominators in the expression 

2 7x+? 

x — 5 x 2 + 2x + 7 

in order to arrive at the numerator of the original rational function. This process essentially 
compares the coefficients of x 2 and a; 0 as before. 

(c) Denominator of the given rational function contains a repeated linear factor 

In general, examples of this kind will not be more complicated than for a rational func- 
tion with one repeated linear factor together with either a non-repeated linear factor or a 
quadratic factor. 

EXAMPLE 

Express the rational function 

9 

(x + l) 2 (a; — 2) 

in partial fractions. 

Solution 

First we observe that, from paragraph (b), the partial fraction corresponding to the repeated 
linear factor would be of the form 

Ax + B 
(x + l) 2 ’ 
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but this may be written 

A(x + 1) + B-A_ A B-A A C 

(x + l) 2 X + 1 (x + l) 2 X + 1 (x + l) 2 ’ 

Thus, a better form of statement for the problem as a whole, is 

9 A C D 

(x + l) 2 (x — 2) x + 1 (x + l) 2 x — 2 ’ 

Eliminating fractions, we obtain 

9 = A(x + l)(x — 2) + C(x — 2) + D(x + l) 2 . 

Putting x — — 1 gives 9 = —3 C so that C = —3. 

Putting x = 2 gives 9 = 9 D so that D — 1. 

Equating coefficients of a: 2 gives 0 = A + D so that A = — 1. 

Therefore, 

9 _ 1 3 1 

(x + l) 2 (x — 2) x + 1 (x + l) 2 x — 2 ’ 

Notes: 

(i) Similar partial fractions may be developed for higher repeated powers so that, for a 
repeated linear factor of power of n, there will be n corresponding partial fractions, each 
with a constant numerator. The labels for these numerators in future will be taken as A, B, 
C, etc. in sequence. 

(ii) We observe that the numerator above the repeated factor itself (D in this case) could 
actually have been obtained by the cover-up rule; covering np (x + l) 2 in the original rational 
function, then substituting x — — 1 into the rest. 

(d) Keily’s Method 

A useful method for repeated linear factors is to use these factors one at a time, keeping the 
rest outside the expression as a factor. 

EXAMPLE 

Express the rational function 



(x + l) 2 (a: — 2) x + 1 [(x + l)(x — 2)J 
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in partial fractions. 

Solution 

Using the cover-up rule inside the square brackets, 


9 

(x + l) 2 (x — 2) 


1 [ -3 + 3 ’ 
x + 1 Lrr + 1 x — 2j 


3 3 
(x + l) 2 (x + l){x — 2) ’ 


and, again by cover-up rule, 


3 _ 1 1 

(x + l) 2 x + 1 x — 2 

as before. 


Warning 

Care must be taken with Kcily’s method when, even though the original rational function 
is proper, the resulting expression inside the square brackets is improper. This would have 
occurred, for instance, if the problem given had been 

9a; 2 

(x + l) 2 (a; — 2) ’ 


leading to 


1 9a; 2 

x + 1 (a; + l)(a; — 2) 


In this case, long division would have to be used inside the square brackets before proceeding 
with Kcily’s method. 

For such examples, it is probably better to use the method of paragraph (c). 

1.9.3 EXERCISES 

Express the following rational functions in partial fractions: 


1 . 


3a; + 5 

(x + l)(x 4- 2) ' 
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2. 

17x + 11 

(x — l)(.r — 2){x + 3) ’ 

3. 

3a; 2 -8 

(x — l)(x 2 + x — 7) ' 

4. 

2x + 1 

(x + 2) 2 (x — 3) ' 

5. 

9 + llx — x 2 
(x + l) 2 (a; + 2) ' 

6 . 

x 5 

(x + 2){x — 4) ’ 

1.9.4 ANSWERS TO EXERCISES 


1 . 

2 . 

3. 

4. 

5. 

6 . 


2 1 
x + 1 x + 2 

13 4 

a; + 1 a; — 2 a; + 3’ 

1 2a; + 1 

a; — 1 x 2 + x — 7 

3 7 7 

5(x + 2) 2 _ 25(a; + 2) + 25(a;-3)‘ 

3 16 17 

(x + l) 2 + x + 1 x + 2 


x 3 + 2x 2 + 12a; + 40 H — 7 + 

3 (a; + 2) 


512 

3 (a; — 4) ' 
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UNIT 1.10 - ALGEBRA 10 - INEQUALITIES 1. 

1.10.1 INTRODUCTION 

If the symbols a and b denote numerical quantities, then the statement 

a < b 

is used to mean “a is less than b” while the statement 

b > a 


is used to mean “b is greater than a” 

These are called “strict inequalities” because there is no allowance for the possibilty that 
a and b might be equal to each other. For example, if a is the number of days in a particular 
month and b is the number of hours in that month, then b > a. 

Some inequalities do allow the possibility of a and b being equal to each other and are called 
“weak inequalities” written in one of the forms 

a < b b < a a > b b > a 


For example, if a is the number of students who enrolled for a particular module in a 
university and b is the number of students who eventually passed that module, then a > b. 

1.10.2 ALGEBRAIC RULES FOR INEQUALITIES 

Given two different numbers, one of them must be strictly less than the other. Suppose a is 
the smaller of the two and b the larger; i.e. 


a < b 


Then 

1. a + c < b + c for any number c. 

2. ac < be when c is positive but ac > be when c is negative. 

3. - > | provided a and b are both positive. 

Note: 

The only other situation in 3. which is consistent with a < b will occur if a is negative and 
b is positive. In this case, ^ | because a negative number is always less than a positive 

number. 
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EXAMPLES 


1. Simplify the inequality 


2x + 3y > 5x — y + 7. 


Solution 

We simply deal with this in the same way as we would deal with an equation by adding 
appropriate quantities to both sides or subtracting appropriate quantities from both 
sides. We obtain 

—3a; + Ay > 7 or 3a; — Ay + 7 < 0. 


2. Solve the inequality 



< 2 , 


assuming that x ^ 1. 

Solution 

Here we must be careful in case x — 1 is negative; the argument is therefore in two parts: 
(a) If x > 1, i.e. x — 1 is positive, then the inequality can be rewritten as 


1 < 2 (a; 


1) or x — 1 > 


Hence, 


x > 


3 

2 ' 


(b) If a; < 1, i.e. x — 1 is negative, then the inequality is automatically true since a 
negative number is bound to be less than a positive number. 

Conclusion: 

The given inequality is satisfied when x < 1 and when x > |. 

3. Solve the inequality 

2a; - 7 < 3. 


Solution 

Adding 7 to both sides, then dividing both sides by 2 gives 

x < 5. 


4. Solve the inequality 


x — 1 
x — 6 


> 0 . 
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Solution 


We first observe that the fraction on the left of the inequality can equal zero only when 
x — 1. 

Secondly, the only way in which a fraction can be positive is for both numerator and 
denominator to be positive or both numerator and denominator to be negative. 

(a) Suppose x — 1 > 0 and x — 6 > 0; these two are covered by x > 6. 

(b) Suppose x — 1 < 0 and x — 6 < 0; these two are covered by x < 1. 

Note: 

The value x = 6 is problematic because the given expression becomes infinite; in fact, 
as x passes through 6 from values below it to values above it, there is a sudden change 
from — oo to +oo. 

Conclusion: 

The inequality is satisfied when either x > 6 or x < 1. 

1.10.3 INTERVALS 

In scientific calculations, a variable quantity x may be restricted to a certain range of values 
called an “interval” which may extend to oo or — oo; but, in many cases, such intervals 
have an upper and a lower “bound”. The standard types of interval are as follows: 

(a) a < x < b denotes an “open interval” of all the values of x between a and b but 
excluding a and b themselves. The symbol (a, b) is also used to mean the same thing. For 
example, if a; is a purely decimal quantity, it must lie in the open interval 

-1 < x < 1. 


(b ) a < x < b denotes a “closed interval” of all the values of x from a to b inclusive. The 
symbol [a, b] is also used to mean the same thing. For example, the expression y/1 — x 2 has 
real values only when 

-1 < x < 1. 


Note: 

It is possible to encounter intervals which are closed at one end but open at the other; they 
may be called either “half open” or “half closed” . For example 

a < x < b or a < x < 6, 

which can also be denoted respectively by (a, b] and [a, b). 
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(c) Intervals of the types 


x > a x > a x < a x < a 

are called “infinite intervals”. 

1.10.4 EXERCISES 

1. Simplify the following inequalities: 

(a) 

x + y <2x + y + 1) 

(b) 

2a — b > 1 + a — 2b — c. 

2. Solve the following inequalities to find the range of values of x. 

(a) 

(b) 

(c) 

(d) 

3. Classify the following intervals 

(a) 

(b) 


x + 3 < 6; 


—2x > 10; 


2 > 18; 
x 


x + 3 


< 0 . 


2x - 1 

as open, closed, or half-open/half-closed: 


(5,8); 


(-3,-2); 
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[2,4); 


[8,23]; 
(— oo, oo) 
(0, oo); 
[0, oo). 

1.10.5 ANSWERS TO EXERCISES 

1. (a) x > —1; 

(b) a + b + c > 1. 

2. (a) x < 3; 

(b) x < —5; 

(c) x < | since x > 0; 

(d) -3 < x < 

3. (a) open; 

(b) open; 

(c) half-open/half-closed; 

(d) closed; 

(e) open; 

(f) open; 

(g) half-open/half-closed. 


(d) 

(e) 

(f) 

(g) 
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UNIT 1.11 - ALGEBRA 11 - INEQUALITIES 2. 

1.11.1 RECAP ON MODULUS, ABSOLUTE VALUE OR NUMERICAL VALUE 

As seen in Unit 1.1, the Modulus of a numerical quantity ignores any negative signs if there 
are any. For example, the modulus of —3 is 3, but the modulus of 3 is also 3. 

The modulus of an unspecified numerical quantity x is denoted by the symbol 

I x I 


and is defined by the two statements: 


| x \— x if x > 0; 
x 1= —x if x < 0. 


Notes: 

(i) An alternative, but less convenient formula for the modulus of x is 


x 


(ii) It is possible to show that, for any two numbers a and b, 

I a + b I < I a I + I b I . 


This is called the “triangle inequality” and can be linked to the fact that the length of 
any side of a triangle is never greater than the sum of the lengths of the other two sides. 

The proof is a little involved since it is necessary to consider all possible cases of a and b 
being positive, negative or zero together with a consideration of their relative sizes. It will 
not be included here. 

1.11.2 INTERVAL INEQUALITIES 

(a) Using the Modulus notation 

In this section, we investigate the meaning of the inequality 

| x — a |< k, 

where a is any number and k is a positive number. 
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Case 1. x — a > 0. 


The inequality can be rewritten as 

x — a < k i.e. x < a + k. 


Case 2. x — a < 0. 

The inequality can be rewritten as 

—(x — a) < k i.e. a — x < k i.e. x > a — k. 

Combining the two cases, we conclude that 

| x — a | < k means a — k < x < a + k, 

which is an open interval having x = a at the centre and extending to a distance of a either 
side of the centre. A similar interpretation could be given of | x — a \< k. 

EXAMPLE 

Obtain the closed interval represented by the statement 

I x + 3 |< 10. 


Solution 

Using a = —3 and k = 10, we have 

-3 - 10 < x < -3 + 10. 


That is, 

-13 < x < 7. 


(b) Using Factorised Polynomials 

Suppose a polynomial in x has been factorised into a number of linear factors corresponding 
to the degree of the polynomial. Then, if certain values of x are substituted in, the polynomial 
will be positive (or zero) as long as the number of individual factors which become negative is 
even. Similarly, the polynomial will be negative (or zero) as long as the number of individual 
factors which become negative is odd. 

These observations enable us to find the ranges of values of x for which a factorised polyno- 
mial is positive or negative. 
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EXAMPLE 


Find the range of values of x for which the polynomial 

(x + 3)(x — l)(x — 2) 

is strictly positive. 

Solution 

The first task is to hnd what are called the “critical values” . These are the values of x at 
which the polynomial becomes equal to zero. In our case, the critical values are x = —3, 1, 2. 

Next, the critical values divide the x-line into separate intervals where, for the moment, we 
exclude the critical values themselves. In this case, we obtain 

x < —3, — 3 < x < 1, 1 < x < 2, x > 2. 


All that is now necessary is to select a value from each of these intervals and investigate how 
it affects the signs of the factors of the polynomial and hence the sign of the polynomial 
itself. 

x < — 3 gives (neg) (neg) (neg) and therefore < 0; 

— 3 < x < 1 gives (pos) (neg) (neg) and therefore > 0; 

1 < x < 2 gives (pos) (pos) (neg) and therefore < 0; 
x > 2 gives (pos) (pos) (pos) and therefore > 0. 

Clearly the critical values will not be included in the answer for this example because they 
cause the polynomial to have the value zero. 

The required ranges are thus 


— 3 < x < 1 and x > 2. 


Note: 

An alternative method is to sketch the graph of the polynomial as a smooth curve passing 
through all the critical values on the x-axis. One point only between each critical value will 
make it clear whether the graph is on the positive side of the x-axis or the negative side. For 
the above example, the graph is as follows: 
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1.11.3 EXERCISES 

1. Determine the precise ranges of values of x which satisfy the following inequalities: 

(a) i x |< 2; 

(b) | x |> 3; 

(c) | x — 3 |< 1; 

(d) j x + 6 |< 4; 

(e) j x + 1 |> 2; 

(f) 0 <| x + 3 |< 5. 

2. Determine the precise ranges of values of x which satisfy the following inequalities: 

(a) (x — 4) (a; + 2) > 0; 

(b) (x + 3)(x — 2)(x — 4) < 0; 

(c) (x + l) 2 (a; — 3) > 0; 

(d) 18a; — 3a; 2 > 0. 

3. By considering the expansion of (a 2 — 6 2 ) 2 , show that 

a 4 + 6 4 > 2 a 2 b 2 . 
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1.11.4 ANSWERS TO EXERCISES 


(a) 

— 

2 ■ 

< x < 2; 



(b) 

X 

< 

—3 and 

x > 3; 


(c) 

2 

< 

x < 4; 



(d) 

— 

10 

< x < - 

-2; 


(e) 

X 

> 

1 and x 

<-3; 


(f) 

— 

■8 ■ 

< x < — 

3 and 

-3 < 

(a) 

X 

< 

—2 and 

x > 4; 


(b) 

X 

< 

—3 and 2 < x 

<4; 

(c) 

X 

> 

3; 



(d) 

0 

< 

x < 6. 




x <2. 
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UNIT 2.1 - SERIES 1 - ELEMENTARY PROGRESSIONS AND SERIES 
2.1.1 ARITHMETIC PROGRESSIONS 

The “sequence” of numbers, 

o, a T d, cl -|- 2d, a -t- 3 d } ... 
is said to form an “arithmetic progression” . 

The symbol a represents the “first term” , the symbol d represents the “common differ- 
ence” and the “n-th term” is given by the expression 

a + (n — 1 )d 


EXAMPLES 


1. Determine the n-th term of the arithmetic progression 15, 12, 9, 6 ,... 

Solution 

The n-th term is 

15 + (n — 1)(— 3) = 18 -3n. 

2. Determine the n-th term of the arithmetic progression 

8,8.125,8.25,8.375,8.5,... 


Solution 

The n-th term is 

8 + (n - 1) (0.125) = 7.875 + 0.125n. 

3. The 13th term of an arithmetic progression is 10 and the 25th term is 20; calculate 

(a) the common difference; 

(b) the first term; 

(c) the 17th term. 

Solution 

Letting a be the first term and d. be the common difference, we have 

a + 12 d = 10 


and 


a + 24 d = 20. 


(a) Subtracting the first of these from the second gives 12 d = 10, so that the common 
difference d — || = | ~ 0.83 
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(b) Substituting into the first of the relationships between a and d gives 


That is, 


5 

a + 12 x - = 10. 

6 


a + 10 = 10. 


Hence, a = 0. 

(c) The 17th term is 


0 + 16 x ^ 
6 


80 

~ 6 ~ 


40 

y 


13.3 


2.1.2 ARITHMETIC SERIES 

If the terms of an arithmetic progression are added together, we obtain what is called an 
“arithmetic series” . The total sum of the first n terms of such a series can be denoted by 
S n so that 

S n — a T [a T d] T [a T 2d] T ... T [a T ( n — 2 )d] T [a T (n — l)d]j 

but this is not the most practical way of evaluating the sum of the n terms, especially when 
n is a very large number. 

A trick which provides us with a more convenient formula for S n is to write down the existing 
formula backwards. That is, 

S n = [a T (tl — l)d] T [a T (n — 2)d] T 1 [fl T 2d] T [a T d] T cl. 


Adding the two statements now gives 

2 S n = [2a + (n — l)d] + [2a + (n — l)d] + ... + [2a + (n — l)d] + [2a + (n — l)d], 
where, on the right hand side, there are n repetitions of the same expression. 

Hence, 

2 S n = n[2a + (n — l)d] 
or 

Tl 

S n = -[2a + (n - 1 )d]. 


This version of the formula is suitable if we know the values of a, n and d; but an alternative 
version can be used if we know only the first term, the last term and the number of terms. 
In this case, 


77 

S n = -[FIRST + LAST] 
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which is simply n times the average of the first and last terms. 

EXAMPLES 


1. Determine the sum of the natural numbers from 1 to 100. 

Solution 

The sum is given by 

^ x [1 + 100] = 5050. 

2. How many terms of the arithmetic series 

10 + 12 + 14 + ... 

must be taken so that the sum of the series is 252 ? 

Solution 

The first term is clearly 10 and the common difference is 2. 

Hence, letting n be the number of terms, we require that 

Tl 

252 = -[20 + (n- 1) x 2], 

That is, 

Tl 

252 = -[2n+ 18] = n{n + 9). 

By trial and error, n = 12 will balance this equation; but it is more conclusive to obtain 
n as the solution to the quadratic equation 

n 2 + 9n — 252 = 0 

or 

(n — 12) (n + 21) = 0, 

which gives n = 12 only, since the negative value n = —21 may be ignored. 

3. A contractor agrees to sink a well 250 metres deep at a cost of £2.70 for the first metre, 
£2.85 for the second metre and an extra 15p for each additional metre. Find the cost 
of the last metre and the total cost. 

Solution 

In this problem we are dealing with an arithmetic series of 250 terms whose first term 
is 2.70 and whose common difference is 0.15. The cost of the last metre is the 250-th 
term of the series and therefore 

£[2.70 + 249 x 0.15] = £40.05 

The total cost will be 

£^p x [2.70 + 40.05] = £5343.75 
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2.1.3 GEOMETRIC PROGRESSIONS 


The sequence of numbers 


a, ar, ar 2 , ar 3 , ... 


is said to form a “geometric progression 1 ' . 


The symbol a represents the “first term” , the symbol r represents the “common ratio” 
and the “n-th term” is given by the expression 


EXAMPLES 


1. Determine the n-th term of the geometric progression 

3,-12,48,-192,... 

Solution 

The progression has n-th term 

3(— 4) n_1 

which will always be positive when n is an odd number and negative when n is an even 
number. 

2. Determine the seventh term of the geometric progression 

3,6,12,24,... 


Solution 

The seventh term is 


3(2 6 ) = 192. 


3. The third term of a geometric progression is 4.5 and the ninth term is 16.2. Determine 
the common ratio. 

Solution 

Firstly, we have 

ar 2 = 4.5 


and 


ar s = 16.2 


Dividing the second of these by the first gives 

ar 8 16.2 
ar 2 4.5 
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Therefore, 


and so 


r 6 = 3.6 
r ~ 1.238 


4. The expenses of a company are £200,000 a year. It is decided that each year they shall 
be reduced by 5% of those for the preceding year. 

What will be the expenses during the fourth year, the first reduction taking place at 
the end of the first year ? 

Solution 

In this problem, we use a geometric progression with first term 200,000 and common 
ratio 0.95. 

The expenses during the fourth year will thus be the fourth term of the progression; 
that is, £200,000 x(0.95) 3 = £171475. 


2.1.4 GEOMETRIC SERIES 

If the terms of a geometric progression are added together, we obtain what is called a 
“geometric series” . The total sum of a geometric series with n terms may be denoted by 
S n so that 

S n = a + ar + ar 2 + ar 3 + ... + ar" -1 

but, as with arithmetic series, this is not the most practical formula for evaluating S n . 

This time, a trick to establish a convenient formula for S n is to write down both S n and rS n , 
the latter giving 

rS n = ar + ar 2 + ar 3 + ar 4 + ... + ar n ~ 4 + ar 71 . 


Subtracting the second formula from the first gives 

S n — rS n = a — ar n , 

so that 

O «(1 - r n ) 

^ n 

1 — r 

This is the version of the formula most commonly used since, in many practical applications, 
r will be less than one; but, for examples in which r is greater than one, it may be better to 
use the alternative version, namely 

c _ a (r n ~ 1) 

n ~> 

r — 1 
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In fact, either version may be used whatever the value of r is. 

EXAMPLES 


1. Determine the sum of the geometric series 


Solution 

The sum is given by 






S 6 = 


4 ( 1 - (|) 6 ) 

i i 


4(1-0.0156) 

045 


~ 7.875 


2. A sum of money £C is invested for n years at an interest of 100r%, compounded 
annually. What will be the total interest earned by the end of the n-th year ? 

Solution 

At the end of year 1, the interest earned will be Cr . 

At the end of year 2, the interest earned will be (C + Cr)r = Cr( 1 + r). 

At the end of year 3, the interest earned will be C(1 + r)r + C( 1 + r)r 2 = Cr( 1 + r) 2 . 

This pattern reveals that, 

at the end of year n, the interest earned will be Cr ( 1 + r) n_1 . 

Thus the total interest earned by the end of year n will be 

Cr + Cr( 1 + r) + Cr( 1 + r) 2 + ... + Cr( 1 + r) n_1 , 

which is a geometric series of n terms with first term Cr and common ratio 1 + r. Its 
sum is therefore 


Cr((l T r) n - 1) 
r 


C((l+r) n -l). 


Note: 

The same result can be obtained using only a geometric progression as follows: 

At the end of year 1, the total amount will be C + Cr = C( 1 + r). 

At the end of year 2, the total amount will be C( 1 + r) + C(1 + r)r — (7(1 + r) 2 . 

At the end of year 3, the total amount will be C( 1 + r) 2 + C( 1 + r) 2 r = C( 1 + r) 3 . 

At the end of year n, the total amount will be C( 1 + r) n . 

Thus the total interest earned will be C( 1 + r) n — C = C((l + r) n — 1) as before. 
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The sum to infinity of a geometric series. 

In a geometric series with n terms, suppose that the value of the common ratio, r, is numer- 
ically less than 1. Then the higher the value of n, the smaller the numerical value of r n , to 
the extent that, as n approaches infinity, r n approaches zero. 

We conclude that, although it not possible to reach the end of a geometric series which has 
an infinite number of terms, its sum to infinity may be given by 


EXAMPLES 

1. Determine the sum to infinity of the geometric series 


Solution 

The sum to infinity is 



25 

~ 6 ~ 


~ 4.17 


2. The yearly output of a silver mine is found to be decreasing by 25% of its previous 
year’s output. If, in a certain year, its output was £25,000, what could be reckoned as 
its total future output ? 

Solution 

The total output, in pounds, for subsequent years will be given by 

25000 x 0.75 + 25000 x (0.75) 2 + 25000 x (0.75) 3 + ... = 25000 x °- 75 = 75000. 

v ; v ’ 1 - 0.75 


2.1.5 MORE GENERAL PROGRESSIONS AND SERIES 
Introduction 

Not all progressions and series encountered in mathematics are either arithmetic or geometric. 
For instance: 


l 2 , 2 2 , 3 2 , 4 2 , , 


.,n 
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has a clearly defined pattern but is not arithmetic or geometric. 

An arbitrary progression of n numbers which conform to some regular pattern is often 
denoted by 


^1? ^2 ) ^4 5 ) 


and it may or may not be possible to find a simple formula for the sum S n . 

The Sigma Notation (£). 

If the general term of a series with n terms is known, then the complete series can be written 
down in short notation as indicated by the following illustrations: 

1 . 

n 

cl T (o d) “f (o T 2d) T T (n T \ji — l]d) — ) ) (o T [r — l]d). 

r= 1 

2. 

n 

a + ar + ar 2 + ar n_1 = 53 ar k ~ l . 

k= 1 

3. 

n 

l 2 + 2 2 + 3 2 + n 2 = Y J r 2 . 

r= 1 

4. 

n 

-l 3 4 + 2 3 - 3 3 + 4 3 + (— l) n n 3 = 53 (-l) r r 3 . 

r=l 


Notes: 

(i) It is sometimes more convenient to count the terms of a series from zero rather than 1. 
For example: 


n— 1 

..a + [n — 1 ]d = 53 ( a + rc 0 

r = 0 
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a + + d) + (ci + 2d) + 



and 


n— 1 

a + ar + ar 2 + ar 3 + ar n_1 = ^ ar k . 

k=0 

In general, for a series with n terms starting at Uq, 

n — 1 

Uo + Ml + U 2 + u 3 + + M„-l = U r- 

r=0 


(ii) We may also use the sigma notation for “infinite series” such as those we encountered 
in the sum to infinity of a geometric series. For example 


1 i 1 
1 + 3 + 32 + iF + 


°° 1 

V T or 

' Qr — 1 
r= 1 6 


E 


r=0 


1 

3 r 


STANDARD RESULTS 

It may be shown that 


n 2 

E r = o n ( n + !). 

r = 1 Z 


and 


n 

r 2 = -n(n + l)(2n + 1) 
“I 6 


E 


■i 

.2 


n(n + 1) 


n2 


Outline Proofs: 

The first of these is simply the formula for the sum of an arithmetic series with first term 1 
and last term n. 
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The second is proved by summing, from r = 1 to n, the identity 

(r + l) 3 — r 3 = 3r 2 + 3r + 1. 

The third is proved by summing, from r = 1 to n, the identity 

(r + l) 4 — r 4 = 4r 3 + 6r 2 + 4r + 1. 

EXAMPLE 

Determine the sum to n terms of the series 


1-2-3 + 2- 3- 4 + 3- 4- 5 + 4- 5- 6 + 5- 6- 7 + 


Solution 

The series is 


n n n n n 

y r(r + l)(r + 2) = ^ r 3 + 3r 2 + 2r = ^ r 3 + 3 ^ r 2 + 2 ^ r. 

r = 1 r= 1 r = 1 r=l r= 1 


Using the three standard results, the summation becomes 


i 2 


-n(n + 1) 


+ 3 


-n(n + l)(2n + 1) 
o 


+ 2 


-n(n + 1) 


= ^n(n + 1) [n(n + 1) + An + 2 + 4] = ^n(n + 1) [n 2 + 5n + 6]. 


This simplifies to 


-n{n + l)(n + 2 ){n + 3). 


10 



2.1.6 EXERCISES 


1. Write down the next two terms and also the n-tli term of the following sequences of 
numbers which are either arithmetic progressions or geometric progressions: 

(a) 40, 29, 18, 7, 

(b) f , f , 7, 

(c) 5, 15, 45, 

(d) 10, 9.2, 8.4, ...; 

(e) 81, -54, 36, ...; 

(f) - - — 

2. The third term of an arithmetic series is 34 and the 17th term is —8. Find the sum of 
the first 20 terms. 

3. For the geometric series 1 + 1.2 + 1.44 + ..., find the 6th term and the sum of the first 
10 terms. 

4. A parent places in a savings bank £25 on his son’s first birthday, £50 on his second, £75 
on his third and so on, increasing the amount by £25 on each birthday. How much will 
be saved up (apart from any accrued interest) when the boy reaches his 16th birthday 
if the final amount is added on this day ? 

5. Every year, a gardner takes 4 runners from each of his one year old strawberry plants 
in order to form 4 additional plants. If he starts with 5 plants, how many new plants 
will he take at the end of the 6th year and what will then be his total number of 
plants ? 

6. A superball is dropped from a height of 10m. At each rebound, it rises to a height 
which is 90% of the height from which it has just fallen. What is the total distance 
through which the ball will have moved before it finally comes to rest ? 

7. Express the series 

2 4 6 8 

— I 1 1 b nterrns 

3 5 7 9 

in both of the forms 

n n— 1 

y u r and ^2 u r ■ 

r= 1 r=0 

Hint: 

Find the pattern in the numerators and denominators separately 
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8. Determine the sum to n terms of the series 

1-3 + 2-5 + 3-7 + 4-9 + . . . 

2.1.7 ANSWERS TO EXERCISES 


1. (a) -4, -15; 51 - lln; 

( b ) 8|, 9|; |(9 + 4 n); 

(c) 135, 405; 5(3) n " 1 ; 


(d) 7.6, 6.8; 10.8 - 0.8n; 

(e) -24, 16; (_ 1 )n-l 2 n-l 3 5-n. 

/f \ _9_ _27_ . 3 n ~ 2 

v 1 / 64’ 256’ 4 n— 1 ’ 


2. a = 40, d = -3, S 20 = 230. 

3. 6th term = (1.2) 5 ~ 2.488 and S l0 ~ 25.96 

4. £3400. 

5. Number at year 6 = 5 x 4 6 = 20480; Total = 27305. 


6. Total distance 

7. 


10 + 


2x10x0.9 


1-0.9 


190m. 


E 


2 r 

2 r + 1 


and 


n— 1 


E 


2 (r + 1) 
2r + 3 


1 

6 


n(n + l)(4n + 5). 
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UNIT 2.2 - SERIES 2 - BINOMIAL SERIES 


INTRODUCTION 

In this section, we shall be concerned with the methods of expanding (multiplying out) an 
expression of the form 

(A + B) n , 

where A and B are either mathematical expressions or numerical values, and n is a given 
number which need not be a positive integer. However, we shall deal first with the case when 
n is a positive integer, since there is a useful aid to memory for obtaining the result. 

2.2.1 PASCAL’S TRIANGLE 

Initially, we consider some simple illustrations obtainable from very elementary algebraic 
techniques in earlier work: 

1. (A + B) 1 = 

A + B. 

2. (A + B) 2 = 

A 2 + 2AB + B 2 . 

3. (A + B) 3 = 

A 3 + 3 A 2 B + 3AB 2 + B 3 . 

4. (A + B) 4 = 

A 4 + 4 A 3 B + 6 A 2 B 2 + 4 AB 3 + B 4 . 

OBSERVATIONS 

(i) We notice that, in each result, the expansion begins with the maximum possible 
power of A and ends with the maximum possible power of B. 

(ii) In the sequence of terms from beginning to end, the powers of A decrease in steps 
of 1 while the powers of B increase in steps of 1. 
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(iii) The coefficients in the illustrated expansions follow the diagramatic pattern called 

PASCAL’S TRIANGLE: 

1 1 
12 1 
13 3 1 

1 4 6 4 1 

and this suggests a general pattern where each line begins and ends with the number 1 
and each of the other numbers is the sum of the two numbers above it in the previous 
line. For example, the next line would be 

1 5 10 10 5 1 


giving the result 

5. ( A + B ) 5 = 

A 5 + 5 A 4 B + 10 A 3 B 2 + 10 A 2 B 3 + 5AB 4 + B 5 . 

(iv) The only difference which occurs in an expansion of the form 

(A - B) n 

is that the terms are alternately positive and negative. For instance, 

6. ( A-B ) 6 = 

A 6 - 6 A 5 B + 15 A 4 B 2 - 20 A 3 B 3 + 15 A 2 B 4 - 6 AB 5 + B 6 . 


2.2.2 BINOMIAL FORMULAE 

In (A + B) n , if n is a large positive integer, then the method of Pascal’s Triangle can become 
very tedious. If n is not a positive integer, then Pascal’s Triangle cannot be used anyway. 

A more general method which can be applied to any value of n is the binomial formula 
whose proof is best obtained as an application of differential calculus and hence will not be 
included here. 

Before stating appropriate versions of the binomial formula, we need to introduce a standard 
notation called a “factorial” by means of the following definition: 
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DEFINITION 

If n is a positive integer, the product 


1.2.3.4.5 n 

is denoted by the symbol n\ and is called “n factorial”. 

Note: 

This definition could not be applied to the case when n — 0, but it is convenient to give a 
meaning to 0! We define it separately by the statement 

0 ! = 1 

and the logic behind this separate definition can be made plain in the applications of calculus. 
There is no meaning to n\ when n is a negative integer. 

(a) Binomial formula for {A + B) n when n is a positive integer. 

It can be shown that 

(A + B) n = A n + noA^B + n ^ n ~ ^ A n ~ 2 B 2 + W ( W ~ ^ ~ ^ A n ~ 3 B 3 + + B n . 


Notes: 

(i) This is the same as the result which would be given by Pascal’s Triangle. 

(ii) The last term in the expansion is really 

n(n — l)(n — 2)(n — 3) 3.2.1 


_ A n-n B n = R n = 


n\ 


(iii) The coefficient of A n r B r in the expansion is 

n(n — 1 )(n — 2) (n — 3) (n — r + 1) 


n\ 

( n — r)!r! 


and this is sometimes denoted by the symbol 


(iv) A commonly used version of the result is given by 

^ ^ n(n — 1) 2 n(n-l)(n-2) s 

(1 + x) = 1 + nx 3 — — -x 2 H — -x A + ... + x . 


2 ! 


3! 
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EXAMPLES 


1. Expand fully the expression (1 + 2a;) 3 . 

Solution 

We first note that 

(A + Bf = A 3 + 3 A 2 B + B 2 + B 3 = A 3 + 3 A 2 B + 3 AB 2 + B 3 . 

If we now replace A by 1 and B by 2x, we obtain 

(1 + 2a;) 3 = 1 + 3(2x) + 3(2a;) 2 + (2a;) 3 = 1 + 6a; + 12a; 2 + 8a; 3 . 

2. Expand fully the expression (2 — a;) 5 . 

Solution 

We hrst note that 

(A + Bf = A 5 + 5 A 4 B + 5 4a 3 B 2 + 5 ^ 3 -A 2 B 3 + AB 4 + B 5 . 
v 2! 3! 4! 

That is, 

(A + Bf = A 5 + 5 A a B + 10A 3 5 2 + 10A 2 5 3 + 5AR 4 + B 5 . 

We now replace A by 2 and B by — x to obtain 

(2 - a;) 5 = 2 5 + 5(2) 4 (-a;) + 10(2) 3 (-a;) 2 + 10(2) 2 (-a;) 3 + 5(2)(-a;) 4 + (-a;) 5 . 
That is, 


(2 - a;) 5 = 32 - 80a; + 80a; 2 - 40a; 3 + 10a; 4 - x 5 . 


(b) Binomial formula for (A + Bf when n is negative or a fraction. 

It turns out that the binomial formula for a positive integer index may still be used when 
the index is negative or a fraction, except that the series of terms will be an infinite series. 
That is, it will not terminate. 

In order to state the most commonly used version of the more general result, we use the 
simplified form of the binomial formula in Note (iii) of the previous section: 

RESULT 


If n is negative or a fraction and x lies strictly between x — — 1 and x — 1, it can be shown 
that 

_ ,,, n(n— 1) 2 n(n-l)(n-2) , 

(1 + x) = 1 + nx + — -x 2 + hr -x + 


2 ! 


3! 
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EXAMPLES 


1. Expand (1 + x)* as far as the term in x 3 . 

Solution 


i , 1 

2 : 


i/i 


(1 + x)3 = 1 + Ar + — -x 2 + — 




2 3 

X X X° 

— l — — — — t — — 

2 8 16 

provided — 1 < x < 1. 

2. Expand (2 — x)~ 3 as far as the term in a; 3 stating the values of x for which the series is 
valid. 

Solution 

We first convert the expression (2 — x)~ 3 to one in which the leading term in the bracket 
is 1. That is, 


(2 -x)~ 3 = 


2(1 — — 
2 


-3 


= X 1 + 


-3 


The required binomial expansion is thus: 

1 Fl + <-3) (-1) + ( - 3>( - 3 - 3! 1)( - 3 - 2) 


x\ 3 

“2 + 


That is, 


3x 3x 2 5x 3 

1 + y + — + — + 


The expansion is valid provided that — x/2 lies strictly between —1 and 1. This will be 
so when x itself lies strictly between —2 and 2. 
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(c) Approximate Values 

The Binomial Series may be used to calculate simple approximations, as illustrated by the 
following example: 

EXAMPLE 

Evaluate a/ 1.02 correct to five places of decimals. 

Solution 

Using 1.02 = 1 + 0.02, we may say that 

VUY2 = (1 + 0.02)3. 


That is, 


i 1 (- 1 -) 

Vim = i + -(0.02) + 2 y 2 2y 


( 0 . 02) 2 + 


iH)(- 


1.2.3 


( 0 . 02) 3 + 


= 1 + 0.01 - -(0.0004) + —(0.000008) 
8 16 


= 1 + 0.01 - 0.00005 + 0.0000005 - . . . 


~ 1.010001 - 0.000050 = 1.009951 


Hence Vim ~ 1.00995 
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2.2.3 EXERCISES 


1. Expand the following, using Pascal’s Triangle: 


(b) 


(d) 


(1 + x) 4 5 * ; 
{;x + yf] 
{x-y) 7 \ 
(x — l) 8 . 


2. Use the result of question 1(a) to evaluate 

(1.01) 5 

without using a calculator. 

3. Expand fully the following expressions: 

(a) 


(2x - l) 5 ; 



4. Expand the following as far as the term in a; 3 , stating the values of x for which the 

expansions are valid: 

(a) 

(3 + x) 
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(b) 

(l«2x)3; 

(c) 

(2 + a;)” 4 . 

5. Using the first four terms of the expansion for (1 + x) n , calculate an approximate value 
of vTI, stating the result correct to five significant figures. 

6. If x is small, show that 

/ . i , ,i 3a: 2 

(1 + x) — (1 — 2a;) 2 ~ 

2.2.4 ANSWERS TO EXERCISES 


1. (a) 

(b) 

(c) 

(d) 


1 + 5a: + 10a: 2 + 10a: 3 + 5a: 4 + a: 5 ; 
x 6 + 6 x 5 y + 15a: 4 ?/ 2 + 20 x 3 y 3 + 15a: 2 ?/ 4 + 6 xy 5 + y 6 ; 
x 7 — 7 x 6 y + 21a: 5 y 2 — 35 x 4 y 3 + 35a: 3 ?/ 4 — 21 x 2 y 5 + 7xy 6 — y 7 ; 
x 8 — 8x 7 + 28a: 6 — 56a: 5 + 70a: 4 — 56a: 3 + 28a: 2 — 8a: + 1. 


2. 1.0510100501 to ten places of decimals. 

3. (a) 

32a: 5 - 80a: 4 + 80a: 3 - 40a: 2 + lOx - 1; 

(b) 

27 , 3 , 1 , 

81 + 54a: + —x + -x s + — a: ; 

2 2 16 


x 



4. (a) 



provided — 2 < x < 2. 

5. 1.0488 

6. Expand each bracket as far as the term in x 2 . 


5 
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UNIT 2.3 - SERIES 3 - ELEMENTARY CONVERGENCE AND DIVERGENCE 


Introduction 

In the examination of geometric series in Unit 2.1 and of binomial series in Unit 2.2, the 
idea was introduced of series which have a first term but no last term; that is, there are an 
infinite number of terms. 

The general format of an infinite series may be specified by either 


OO 

Ml + U 2 + U 3 + . . . = U r 

r = 1 


or 

OO 

Mo + Ml + U 2 + • • • ll r . 

r = 0 


In the first of these two forms, u r is the r-th term while, in the second, u r is the (r + l)-th 
term. 

ILLUSTRATIONS 


1. 


2 . 


3. 


1 1 

1 + 2 + 3 + ' ' 


00-1 OO | 

• = £; = £ 


si > Sr, r + l 


OO OO 


2 + 4 + 6 + 8 + . . . = Y, 2r = Y, 2(r + 1). 


r= 1 r = 0 


1 + 3 + 5 + 7 + . . . = y (2r - 1) = Y. (2r + 1). 


r — 1 


r=0 


2.3.1 THE DEFINITIONS OF CONVERGENCE AND DIVERGENCE 

It has already been shown in Unit 2.1 (for geometric series) that an infinite series may have 
a “sum to infinity” even though it is not possible to reach the end of the series. 
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For example, the infinite geometric series 


1 

2 


+ 




= E 


1 

2 r 


is such that the sum, S n , of the first n terms is given by 

1(1 1 
" 1 -| 2 # 

As n becomes larger and larger, S n approaches ever closer to the fixed value, 1. 

We say that the “limiting value” of S n as n “tends to infinity” is 1; and we write 

lim S n = 1. 

n— >oo 

Since this limiting value is a finite number, we say that the series “converges” to 1. 

DEFINITION (A) 

For the infinite series 

OO 

r = 1 


the expression 


U 1 + u 2 + u 3 + • • • + u n 


is called its “n-th partial sum”. 

DEFINITION (B) 

If the n-th partial sum of an infinite series tends to a finite limit as n tends to infinity, the 
series is said to “converge”. In all other cases, the series is said to “diverge”. 
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ILLUSTRATIONS 


1. 


2 . 


3. 



OO 

= ^ — converges. 

r= 1 2 " 


OO 

1 + 2 + 3 + 4 + . . . . = y diverges. 

r= 1 


oo 

1 - 1 + 1 - 1 + .... = 53 (-i)”' 1 diverges. 

r=l 


Notes: 

(i) The third illustration above shows that a series which diverges does not necessarily diverge 
to infinity. 

(ii) Whether a series converges or diverges depends less on the starting terms than it does 
on the later terms. For example 


7 - 15 + 2 + 39 + 


1 1 
2 + 4 + 



1 

16 


+ . . 


converges to 7 — 15 + 2 + 39 + 1 = 33 + 1 = 34. 

(iii) It will be seen, in the next section, that it is sometimes possible to test an infinite series 
for convergence or divergence without having to try and determine its sum to infinity. 

2.3.2 TESTS FOR CONVERGENCE AND DIVERGENCE 

In this section, the emphasis will be on the use of certain standard tests, rather than on 
their rigorous formal proofs. Only outline proofs will be suggested. 

To begin with, we shall consider series of positive terms only. 
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TEST 1 - The r-th Term Test 


An infinite series, 


OO 
r = 1 


cannot converge unless its terms ultimately tend to zero; that is, 

lim u r = 0. 

r— >oo 


Outline Proof: 

The series will converge only if the r-th partial sums, S r , tend to a finite limit, L (say), as 
r tends to infinity; hence, if we observe that u r = S r — SV-i, then u r must tend to zero as r 
tends to infinity since S r and S r _i each tend to L. 

ILLUSTRATIONS 


1. The convergent series 


E 


i 


discussed earlier, is such that 


lim 

r— >oo 


1 

2 r 


0. 


2. The divergent series 


E r > 


discussed earlier, is such that 



3. The series 


is such that 



1 

lim - = 0, 

r— >00 


but it will be shown later that this series is divergent. 

That is, the converse of the r-th Term Test is not true. It does not imply that a series 
is convergent when its terms do tend to zero; merely that it is divergent when its terms 
do not tend to zero. 


TEST 2 - The Comparison Test 

Given the series 


OO 

u r 1 

r=l 

suppose that 

OO 

X V r 

r = 1 


is a second series which is known to converge. 

Then the first series converges provided that u r < v r . 

Similarly, if 


OO 

X Wr 

r = 1 


is a series which is known to diverge, then the first series diverges provided that u r > w r . 

Note: 

It may be necessary to ignore the first few values of r. 
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Outline Proof: 


Suppose we think of u r and v r as the heights of two sets of rectangles, all with a common 
base-length of one unit. 

If the series 

OO 

X l ’r 

r=l 

is convergent it represents a finite total area of an infinite number of rectangles. 

The series 

OO 

X U r 

r = 1 

represents a smaller area and, hence, is also finite. 

A similar argument holds when 

OO 

X W r 

r= 1 

is a divergent series and u r > w r . 

A divergent series of positive terms can diverge only to +oo so that the set of rectangles 
determined by u r generates an area that is greater than an area which is already infinite. 

EXAMPLES 



is divergent. 

Solution 

The given series may be written as 
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1 1 


l + V+ V + T + V + - + = + V + 


1111 


3 4/ \5 6 7 

a series whose terms are all greater than (or, for the second term, equal to) 
But the series 

111 

1 + 2 + 2 + 2 + 


is a divergent series and, hence, the series 


is divergent. 
2. Given that 


is a convergent series, show that 


OO 1 

E ; 

r = 1 ' 


oo 1 

E i 


r = 1 


y — - — 

r(r + 1) 


is also a convergent series. 

Solution 

First, we observe that, for r = 1, 2, 3, 4, . . ., 


< 1 _ 1 


r(r + 1) r.r r 


Hence, the terms of the series 


OO 1 

y — - — 

r(r + 1) 

are smaller in value than those of a known convergent series. It therefore converges 
also. 
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Note: 

It may be shown that the series 


OO 1 

E - 

r=l rP 

is convergent whenever p > 1 and divergent whenever p < 1. This result provides a useful 
standard tool to use with the Comparison Test. 

TEST 3 - D’Alembert’s Ratio Test 

Given the series 

OO 

El u ri 
r = 1 

suppose that 

u r + 1 j 

lim = L: 

r ^°° u r 

Then the series converges if L < 1 and diverges if L > 1. 

There is no conclusion if L — 1. 

Outline Proof: 

(i) If L > 1, all the values of will ultimately be greater than 1 and so u r+ 1 > u r for a 
large enough value of r. 

Hence, the terms cannot ultimately be decreasing; so Test 1 shows that the series diverges. 

(ii) If L < 1, all the values of will ultimately be less than 1 and so u r+ \ < u r for a 
large enough value or r. 

We will consider that this first occurs when r = s; and, from this value onwards, the terms 
steadily decrease in value. 

Furthermore, we can certainly find a positive number, h, between L and 1 such that 



^s+1 

U s 


/ / Us + 2 
< h, 

' u s+ 1 


, m s+3 

< h, 

U s +2 


< h, . . . 


That is, 


u s+ i hu s , u s ~\~ 2 hu s -\~ i, ^s+3 hu s + 2 , . . 


which gives 


«s+l < /Z-U s , M s+2 < /l 2 M s , M s +3 < • • • 

But, since L < h < 1, 


hu s + h 2 u s + h 3 u s + . . . 


is a convergent geometric series; therefore, by the Comparison Test, 

OO 

u s+ 1 + u s+ 2 + u s+ 3 + • • • = 51 “*+»■ converges, 

r=l 


implying that the original series converges also. 

(iii) If L — 1, there will be no conclusion since we have already encountered examples of 
both a convergent series and a divergent series which give L — 1. 

In particular, 


E 


i 

r 


is divergent and gives 


'Uj.-i-i 

inn 

r->°° u r 


lim 

r—> oo 


r 

r + 1 


lim 

r— >oo 


l 

IT 1 

r 


l. 
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Also, 


E 


i 


is convergent and gives 


U r . |_i 

Inn 

r ^°° u r 


lim 

r— > oo 


(r + l ) 2 




1. 


Note: 

A convenient way to calculate the limit as r tends to infinity of any ratio of two polynomials 
in r is first to divide the numerator and the denominator by the highest power of r. 

For example, 


3 + yj _ 3 

2 + yj 2 

ILLUSTRATIONS 

1. For the series 

OO OO 

E U r = E nr’ 

r= 1 r= 1 Z 

u r+ 1 r + 1 T r + 1 
u r 2 r+1 r 2 r 


lim 


3r s + 1 


oo 2r 3 + 1 


= lim 


Thus, 


r u r + 1 r + 1 1 + r 

inn = lim = lim - 

oo r— > oo 2 T r^oo 2 

The limiting value is less than 1 so that the series converges. 


1 

2' 
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2. For the series 


£ <v = £ 2 r , 

r = 1 r= 1 

«r+l = 2 r+1 = 2 
W r 2 r 


Thus, 


lim 1+1 = lim 2 = 2. 

r— >oo ^ r^oo 

The limiting value is greater than 1 so that the series diverges. 

2.3.3 EXERCISES 


1. Use the “r-th Term Test” to show that the following series are divergent: 


(a) 


E 


r 

r + 2’ 


(b) 


E 


1 + 2r 2 
1 + r 2 


2. Use the “Comparison Test” to determine whether the following series are convergent or 
divergent: 

(a) 


E 


1 

(r + l)(r + 2) ’ 


(b) 


E 


r 

a / r 6 + 1 ’ 


11 



r 


E 


r 2 + 1 


3. Use D’Alembert’s Ratio Test to determine whether the following series are convergent 
or divergent: 


(a) 


(b) 


E 


2 r 
^2 ’ 


E 


i 

(2r + 1)!’ 


(c) 


E 


r + 1 


4. Obtain an expression for the r-th term of the following infinite series and, hence, inves- 
tigate them for convergence or divergence: 

(a) 

1 , 1 , 1 , 1 

1 x 2 + 2 x 2 2 + 3 x 2 3 + 4 x 2 4 + ' ' ’ 

(b) 

3 7 15 31 

+ 2x4 + 4x9 + 8 x 16 + 16 x 25 + ' ' 

(c) 

1 , 1 , 1 1 1 
v / 3-12- v / 2V5-v / 3v / 6-2 + v / 7-v / 5 
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2.3.4 ANSWERS TO EXERCISES 


1. (a) 

lim 1+1 = 1^0; 

r ^°° u r 

(b) 

lim 1+1 =2^0. 

u r 

2. (a) Convergent; 

(b) Convergent; 

(c) Divergent. 

3. (a) Divergent; 

(b) Convergent; 

(c) Convergent. 

4. (a) 

1 

u r = ; 

r x 2 r 

The series is convergent by D’Alembert’s Ratio Test 

(b) 

_ 2 r - 1 
Ur ~ 

The series is convergent by Comparison Test; 

(c) 

1 

u r = , :=; 

a Jr + 2 - yjr 

The series is divergent by r-th Term Test. 


Note: 

For further discussion of limiting values, see Unit 10.1 
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UNIT 2.4 - SERIES 4- FURTHER CONVERGENCE AND DIVERGENCE 


2.4.1 SERIES OF POSITIVE AND NEGATIVE TERMS 
Introduction 

In Units 2.2 and 2.3, most of the series considered have included only positive terms. But 
now we shall examine the concepts of convergence and divergence in cases where negative 
terms are present. 

We note here, for example, that the r-th Term Test encountered in Unit 2.3 may be used 
for series whose terms are not necessarily all positive. This is because the formula 


u r 


S r — S r _ i 


is valid for any series. 

The series cannot converge unless the partial sums S r and SV-i both tend to the same finite 
limit as r tends to infinity which implies that u r tends to zero as r tends to infinity. 

A particularly simple kind of series with both positive and negative terms is one whose terms 
are alternately positive and negative. The following test is applicable to such series: 

Test 4 - The Alternating Series Test 

If 

u 4 — u 2 + u 3 — u 4 + . . , where u r > 0, 


is such that 


u r > u r . |_i and u r — > 0 as r — > oo, 


then the series converges. 

Outline Proof: 

(a) Suppose we re-group the series as 


Oi - u 2 ) + (■ u 3 - u 4 ) + (■ u 5 -u 6 )+ . . .; 


then, it may be considered in the form 


1 



oo 

E W 

r = 1 


where V\ = u\ - u 2 , v 2 — u 3 - u 4 , v 3 = u 5 - u 6 , 


This means that v r is positive, so that the corresponding r-th partial sums, S r = v\ + v 2 + 
v 3 + ... +v r , steadily increase as r increases. 

(b) Alternatively, suppose we re-group the series as 


Ml - («2 - M 3 ) - (m 4 - M 5 ) - (u 6 - u 7 ) - . . . ; 


then, it may be considered in the form 


OO 

M 1-J2 W r, 
r = 1 


where w\ = u 2 — u 3 , w 2 = u 4 — u$, w 3 = uq — U7, .... 

In this case, each partial sum, — (wi + tc 2 + m ; 3 + . . . + w r ) is less than u 1 since 

positive quantities are being subtracted from it. 

(c) We conclude that the partial sums of the original series are steadily increasing but are 
never greater than U\ . They must therefore tend to a finite limit as r tends to infinity; that 
is, the series converges. 

ILUSTRATION 

The series 


E 


(-lr 1 


1 

r 



1 

4 


+ . . . 


is convergent since 


1 1 
r r + 1 


and 


0 as r — >• 00. 


2 



2.4.2 ABSOLUTE AND CONDITIONAL CONVERGENCE 


In this section, a link is made between a series having both positive and negative terms and 
the corresponding series for which all of the terms are positive. 

By making this link, we shall be able to make use of earlier tests for convergence and 
divergence. 

DEFINITION (A) 

If 

OO 

Z U r 

r = 1 

is a series with both positive and negative terms, it is said to be “absolutely convergent” 
if 


Z kl 

r = 1 

is convergent. 

DEFINITION (B) 

If 

OO 

Z U r 

r = 1 

is a convergent series of positive and negative terms, but 

OO 

Z kl 

r= 1 

is a divergent series, then the first of these two series is said to be “conditionally conver- 
gent” . 

ILLUSTRATIONS 

1. The series 

11111 
+ 2 2 ~ 3 2 ~ 4 2 + 5 2 + & 2 + ■■■ 
converges absolutely since the series 

11111 
+ 22 + 32 + k + 52 + k + ■■■ 

converges. 
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2. The series 


1 



is conditionally convergent since, although it converges (by the Alternating Series Test), 
the series 


1111 
1+ 2 + 3 + i + 5 + "■ 


is divergent. 


Notes: 

(i) It may be shown that any series of positive and negative terms which is absolutely 
convergent will also be convergent. 

(ii) Any test for the convergence of a series of positive terms may be used as a test for the 
absolute convergence of a series of both positive and negative terms. 

2.4.3 TESTS FOR ABSOLUTE CONVERGENCE 

The Comparison Test 

Given the series 

OO 

r = 1 

suppose that \u r \ < v r where 

OO 

E v r 

r= 1 

is a convergent series of positive terms. Then, the given series is absolutely convergent. 

D’Alembert’s Ratio Test 

Given the series 

OO 

Ur i 

r = 1 

suppose that 


lim 

r— >oo 


w r _i_i 

u r 


L. 
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Then the given series is absolutely convergent if L < 1. 

Note: 

If L > 1, then |w r+1 | > \u r \ for large enough values of r showing that the numerical values 
of the terms steadily increase. This implies that u r does not tend to zero as r tends to 
infinity and, hence, by the r-th Term Test, the series diverges. 

If L — 1, there is no conclusion. 

EXAMPLES 

1. Show that the series 

1 1 1 | 1 1 1 
1x2 ~~ 2x3 _ 3x4 + 4x5 ~~ 5x6 ~ 6x7 + 

is absolutely convergent. 

Solution 

The r-th term of the series is numerically equal to 

1 

r(r + 1) ’ 

which is always less than the r-th term of a known convergent series. 

2. Show that the series 


1 

2 




is conditionally convergent. 

Solution 

The r-th term of the series is numerically equal to 

r 

r 2 + 1 ’ 


which tends to zero as r tends to infinity. 
Also, 


r r + 1 

r 2 + 1 (r + l) 2 + 1 
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since this may be reduced to the true statement r 2 + r > 1. 

Hence, by the Alternating Series Test, the series converges. 

However, it is also true that 

r r 1 

ry * 2 j ^ y>2 j y» ry* j ^ ^ 

and, hence, by the Comparison Test, the series of absolute values is divergent, since 

OO 1 

y — 

r=\ r + ! 

is divergent. 

2.4.4 POWER SERIES 

A series of the form 

OO OO 

ao + a\x + CL 2 X 2 + CI 3 X 3 + . . —'y a r x r or 'y a r - \x r ~ 1 , 

r = 0 r=l 


where x is usually a variable quantity, is called a “power Series in x with coefficients 

&O5 ^1} ^2 1 ^35 • • • 


Notes: 

(i) In this kind of series, it is particularly useful to sum the series from r = 0 to infinity 
rather than from r = 1 to infinity so that the constant term at the beginning (if there is 
one) can be considered as the term in x°. 

But the various tests for convergence and divergence still apply in this alternative notation. 

(ii) A power series will not necessarily be convergent (or divergent) for all values of x and 
it is usually required to determine the specific range of values of x for which the series 
converges. This can most frequently be done using D’Alembert’s Ratio Test. 

ILLUSTRATION 

For the series 


x — 




= £ (-D 


r — 1 


X 


r = 1 
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we have 


U r +l 


(-l) r x r+1 

r 


r 

u r 


r + 1 

(-l) r - 1 x r 


r + 1 


x 


which tends to x as r tends to infinity. 

Thus, the series converges absolutely when |x| < 1 and diverges when x > 1. 
If x = 1, we have the series 


1 - 




* > 


which converges by the Alternating Series Test; while, if x = —1, we have the series 


-1 


1 

2 


1 

3 


1 

4 


* * 1 


which diverges. 

The precise range of convergence for the given series is therefore — 1 < x < 1. 


2.4.5 EXERCISES 

1. Show that the following alternating series are convergent: 
(a) 


111 
1_ 2 + 4“8 + 

(b) 

12 3 4 

IE _ 33 + 34 ~35 + ‘ ‘ 

2. Show that the following series are conditionally convergent: 
(a) 


111 
+ ' ' ; 


7 



(b) 


5 


1x3 2x4 3x5 4x6 

3. Show that the following series are absolutely convergent: 
(a) 


(b) 


1 

3 


3 4 15 16 1 

— I X X — x— + 

2 3 2 4 2 2 5 2 3 


1x2 1x2x3 Ix2x3x4 


3x5 3x5x7 3 x 5 x 7 x 9 


4. Obtain the precise range of values of x for which each of the following power series is 
convergent: 


(a) 


(b) 


2 S 4 

X *" 1 CC CC 

1 + X+ 2! + ¥ + 4! + 


x 


X 


(c) 


(d) 


+ 


x 


+ 


x 


1x2 2x3 3x4 4x5 


„ 3a: 2 4a: 3 5a: 4 

+ ^ + ^ + i^ + 


2x 3x 2 4a: 3 

H 1 1 h 

5 25 125 


+ 



2.4.6 ANSWERS TO EXERCISES 


1. (a) Use u r = 2r L i (numerically); 

(b) Use u r = 7 ^+y (numerically). 

2. (a) Use u r = ^ (numerically); 

(b) Use u r = (numerically). 

3. (a) Use u r = x - (numerically); 

(b) Use = ^r+i)! (numerically). 

4. (a) The power series converges for all values of x 

(b) — 1 < x < 1; 

(c) — 1 < x < 1; 

(d) — 5 < x < 5. 

Note: 

For further discussion of limiting values, see Unit 10.1 
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UNIT 3.1 - TRIGONOMETRY 1 

ANGLES AND TRIGONOMETRIC FUNCTIONS 

3.1.1 INTRODUCTION 


The following results will be assumed without proof: 

(i) The Circumference, (7, and Diameter, D, of a circle are directly proportional to each 
other through the formula 

C = nD 


or, if the radius is r, 


C = 2i rr. 

(ii) The area, A, of a circle is related to the radius, r, by means of the formula 

A = nr 2 . 


3.1.2 ANGULAR MEASURE 

(a) Astronomical Units 

The “degree” is a g^th part of one complete revolution. It is based on the study of 
planetary motion where 360 is approximately the number of days in a year. 

(b) Radian Measure 

A “radian” is the angle subtended at the centre of a circle by an arc which is equal in 
length to the radius. 



RESULTS 

(i) Using the definition of a radian, together with the second formula for circumference on 
the previous page, we conclude that there are 27T radians in one complete revolution. That 
is, 2n radians is equivalent to 360° or, in other words n radians is equivalent to 180°. 

(ii) In the diagram overleaf, the arclcngth from A to B will be given by 

Q 

— x 2n r = r6 , 

2vr 

assuming that 9 is measured in radians. 
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(iii) In the diagram below, the area of the sector ABC is given by 


6 2 1 2 

— x nr = -r 9. 
2n 2 



(c) Standard Angles 

The scaling factor for converting degrees to radians is 

n 

180 

and the scaling factor for converting from radians to degrees is 

180 

n 


These scaling factors enable us to deal with any angle, but it is useful to list the expression, 
in radians, of some of the more well-known angles. 

ILLUSTRATIONS 


1. 15° is equivalent to ^ x 15 = 

2. 30° is equivalent to ^ x 30 = |. 

3. 45° is equivalent to ^ x 45 = |. 

4. 60° is equivalent to ^ x 60 = |. 

5. 75° is equivalent to ^ x 75 = ||. 

6. 90° is equivalent to ^ x 90 = |. 

(d) Positive and Negative Angles 


For the measurement of angles in general, we consider the plane of the page to be divided 
into four quadrants by means of a cartesian reference system with axes Ox and O y. The 
“first quadrant” is that for which x and y are both positive, and the other three quadrants 
are numbered from the first in an anticlockwise sense. 


2 



f 

0 — ^ 


From the positive ^-direction, we measure angles positively in the anticlockwise sense and 
negatively in the clockwise sense. Special names are given to the type of angles obtained as 
follows: 

1. Angles in the range between 0° and 90° are called “positive acute” angles. 

2. Angles in the range between 90° and 180° are called “positive obtuse” angles. 

3. Angles in the range between 180° and 360° are called “positive reflex” angles. 

4. Angles measured in the clockwise sense have similar names but preceded by the word 

“negative” . 

3.1.3 TRIGONOMETRIC FUNCTIONS 

We first consider a right-angled triangle in one corner of which is an angle # other than 
the right-angle itself. The sides of the triangle are labelled in relation to this angle , 9 , as 
“opposite”, “adjacent” and “hypotenuse” (see diagram below). 


opposite 


For future reference, we shall assume, without proof, the result known as “Pythagoras’ 
Theorem” . This states that the square of the length of the hypotenuse is equal to the sum 
of the squares of the lengths of the other two sides. 

DEFINITIONS 



(a) The “sine” of the angle 9 , denoted by sin 9 , is defined by 


sin 9 


opposite 
hypotenuse ’ 


(b) The “cosine” of the angle 9 , denoted by cos 9 , is defined by 

. adjacent 

cos 9 = ; 

hypotenuse 


(c) The “tangent” of the angle 9 , denoted by tan#, is defined by 

opposite 


tan# = 


adjacent 
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Notes: 

(i) The traditional aid to remembering the above definitions is the abbreviation 

S.O.H.C.A.H.T.O.A. 

(ii) The definitions of sin 9 , cos 9 and tan 9 can be extended to angles of any size by regarding 
the end-points of the hypotenuse, with length h, to be, respectively, the origin and the point 
(x, y ) in a cartesian system of reference. 



For any values of x and y, positive, negative or zero, the three basic trigonometric functions 
are defined in general by the formulae 

. a _ V 
sm 9 = — ; 
h 


q- x 
cos 9 = — ; 

h 


tan 9 = — 
x 


sin 9 
cos 9 


Clearly these reduce to the original definitions in the case when 9 is a positive acute angle. 
Trigonometric functions can also be called “trigonometric ratios”. 

(iii) It is useful to indicate diagramatically which of the three basic trigonometric functions 
have positive values in the various quadrants. 




S ine 



A 11 





T an 



C osine 


(iv) Three other trigonometric functions are sometimes used and are defined as the reciprocals 
of the three basic functions as follows: 


“Secant” 


sec 9 


1 

cos 9' 


“Cosecant” 


cosec# = 

sm# 
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“Cotangent” 


cot # 


1 

tan 9 


(v) The values of the functions sin#, cos 9 and tan# for the particlnar angles 30°, 45° and 
60° are easily obtained without calculator from the following diagrams: 



The diagrams show that 

(a) sin45° = (b) cos45° = (c) tan45° = 1; 

(d) sin 30° = |; (e) cos 30° — ^ (f) tan 30° = 

(g) sin 60° = 7^; (h) cos 60° = (i) tan 60° = y/3. 

3.1.4 EXERCISES 

1. Express each of the following angles as a multiple of n 
(a) 65°; (b) 105°; (c) 72°; (d) 252°; 

(e) 20°; (f) -160°; (g) 9°; (h) 279°. 

2. On a circle of radius 24cms., find the length of arc which subtends an angle at the centre 
of 

(a) | radians.; (b) ^ radians.; 

(c) 75°; (d) 130°. 

3. A wheel is turning at the rate of 48 revolutions per minute. Express this angular speed 
in 

(a) revolutions per second; (b) radians per minute; (c) radians per second. 
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4. A wheel, 4 metres in diameter, is rotating at 80 revolutions per minute. Determine the 
distance, in metres, travelled in one second by a point on the rim. 

5. A chord AB of a circle, radius 5cms., subtends a right-angle at the centre of the circle. 
Calculate, correct to two places of decimals, the areas of the two segments into which 
AB divides the circle. 

6. If tan 6 is positive and cos 6 — — what is the value of sin# ? 

7. Determine the length of the chord of a circle, radius 20cms., subtending an angle of 
150° at the centre. 

8. A ladder leans against the side of a vertical building with its foot 4 metres from the 
building. If the ladder is inclined at 70° to the ground, how far from the ground is the 
top of the ladder and how long is the ladder ? 

3.1.5 ANSWERS TO EXERCISES 

1. (a) (b) g; (c) f ; (d) g; (e) |; (f) -f ; (g) (h) . 

2. (a) 16 cms.; (b) ^ cms.; (c) 107T cms.; (d) ^ cms. 

3. (a) | revs, per sec.; (b) 96n rads, per min.; (c) g rads, per sec. 

4. ly 1 metres. 

5. 7.13 square cms. and 71.41 square cms. 

6. sin 9 = — f . 

7. The chord has a length of 38.6cms. approximately. 

8. The top of ladder is 11 metres from the ground and the length of the ladder is 11.7 
metres. 
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UNIT 3.2 - TRIGONOMETRY 2. 

GRAPHS OF TRIGONOMETRIC FUNCTIONS 

3.2.1 GRAPHS OF ELEMENTARY TRIGONOMETRIC FUNCTIONS 

The following diagrams illustrate the graphs of the basic trigonometric functions sin#, cos # 
and tan#, 



The graph illustrates that 

sin(# + 27r) = sin # 

and we say that sin# is a “periodic function with period 27t”. 

Other numbers which can act as a period are ±2n7T where n is any integer; but 2n itself is 
the smallest positive period and, as such, is called the “primitive period” or sometimes 
the “wavelength”. 


We may also observe that 


sin(— #) = — sin # 


which makes sin# what is called an “odd function”. 



The graph illustrates that 


cos(# + 2ir) = cos# 
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and so cos #, like sin#, is a periodic function with primitive period 2n 


We may also observe that 


cos( — #) = cos # 


which makes cos 6 what is called an “even function”. 



This time, the graph illustrates that 

tan(# + 7r) = tan# 

which implies that tan# is a periodic function with primitive period n. 
We may also observe that 

tan(— #) = — tan# 

which makes tan# an “odd function”. 

3.2.2 GRAPHS OF MORE GENERAL TRIGONOMETRIC 
FUNCTIONS 


In scientific work, it is possible to encounter functions of the form 
Asin(u;# + a) and Acos(ca# + a) 


where u and a are constants. 

We may sketch their graphs by using the information in the previous examples 1. and 2. 

EXAMPLES 

1. Sketch the graph of 

y = 5 cos(# — 7 r). 


Solution 

The important observations to make first are that 


2 





(a) the graph will have the same shape as the basic cosine wave but will lie between 
y = — 5 and y — 5 instead of between y = —1 and y — 1; we say that the graph has an 

“amplitude” of 5. 

(b) the graph will cross the 0-axis at the points for which 


a , * , 3?r , 5 tt 

0 — 71 = it — , it , it , 

2 ’ 2 ’ 2 ’ 


that is 


0 


37T 

Y’ 


7T 7T 37T 57T 

Y 2’ Y’ Y’ 


(c) The y-axis must be placed between the smallest negative intersection with the 
0-axis and the smallest positive intersection with the 0 - axis (in proportion to their 
values). In this case, the y-axis must be placed half way between 0 = — | and 0 = |. 



Of course, in this example, from earlier trigonometry results, we could have noticed 
that 

5 cos(0 — 7r) = — 5 cos 0 

so that graph consists of an “upsidedown” cosine wave with an amplitude of 5. However, 
not all examples can be solved in this way. 

2. Sketch the graph of 

y = 3 sin(20 + 1). 


Solution 

This time, the graph will have the same shape as the basic sine wave, but will have an 
amplitude of 3. It will cross the 0-axis at the points for which 

20 + 1 = 0, ±7T, ±2vr, ±3vr, ±4vr, 

and by solving for 0 in each case, we obtain 

0 = ... - 6.78, -5.21, -3.64, -2.07, -0.5, 1.07, 2.64, 4.21, 5.78... 

Hence, the y-axis must be placed between 0 = —0.5 and 0 = 1.07 but at about one 
third of the way from 0 = —0.5 
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UNIT 3.3 - TRIGONOMETRY 

APPROXIMATIONS AND INVERSE FUNCTIONS 

3.3.1 APPROXIMATIONS FOR TRIGONOMETRIC FUNCTIONS 

Three standard approximations for the functions sin 9, cos 9 and tan 6 respectively can be 
obtained from a set of results taken from the applications of Calculus. These are stated 
without proof as follows: 


sin 6 

cos 9 - 
ta n9 


e ~J\ + v 


9 2 9 4 

1 “ 2! + 4! ~~ 
„ 9 3 2 9 5 

: ° + J + l5 


9J_ 

7! 

d 6 

6 ! 

+ . 


These results apply only if 9 is in radians but, if 9 is small enough for 9 2 and higher 
powers of 9 to be neglected, we conclude that 

sin 9 ~ 9, 

cos 9 ~ 1, 
tan 9 ~ 9. 


Better approximations are obtainable if more terms of the infinite series are used. 


EXAMPLE 

Approximate the function 

5 + 2 cos 9 — 7 sin 9 


to a quartic polynomial in 9. 

Solution 

Using terms of the appropriate series up to and including the fourth power of d, we deduce 
that 


9 4 


9 3 


5 + 2 cos 9 - 7sin 9 ~ 5 + 2 — 9 l + - 79 + 7— 


12 


6 


1 

12 


9 4 + U9 3 - 129 2 - 849 + 84 . 
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3.3.2 INVERSE TRIGONOMETRIC FUNCTIONS 


It is frequently necessary to determine possible angles for which the value of their sine, cosine 
or tangent is already specified. This is carried out using inverse trigonometric functions 
defined as follows: 


(a) The symbol 


Sin 1 x 


denotes any angle whose sine value is the number x. It is necessary that — 1 < x < 1 since 
the sine of an angle is always in this range. 


(b) The symbol 


Cos 1 x 


denotes any angle whose cosine value is the number x. Again, — 1 < x < 1. 


(c) The symbol 


Ta n l x 


denotes any angle whose tangent value is x. This time, x may be any value because the 
tangent function covers the range from — oo to oo. 


We note that because of the A 11 , S ine , T angent , C osine diagram, (see Unit 3.1), 
there will be two basic values of an inverse function from two different quadrants. But either 
of these two values may be increased or decreased by a whole multiple of 360° (27 t) to yield 
other acceptable answers and hence an infinite number of possible answers. 


EXAMPLES 


1. Evaluate Sin 1 (|). 

Solution 

Sin _1 (|) = 30° ± n360° or 150° ± n360°. 

2. Evaluate Tan -1 (a/3). 

Solution 

Tan" 1 (a/3) = 60° ± n360° or 240° ± n360°. 

This result is in fact better written in the combined form 
Tan- 1 (a/3) = 60° ± nl80° 

THat is, angles in opposite quadrants have the same tangent. 

Another Type of Question 

3. Obtain all of the solutions to the equation 

cos 3a; = —0.432 

which lie in the interval —180° < x < 180°. 

Solution 

This type of question is of a slightly different nature since we are asked for a specified 
selection of values rather than the general solution of the equation. 
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We require that 3a; be any one of the angles (within an interval —540° < 3x < 540°) 
whose cosine is equal to —0.432. Using a calculator, the simplest angle which satisfies 
this condition is 115.59°; but the complete set is 

±115.59° ±244.41° ±475.59° 

Thus, on dividing by 3, the possibilities for x are 

±38.5° ±81.5° ±158.5° 


Note: The graphs of inverse trigonometric functions are discussed fully in Unit 10.6, but 
we include them here for the sake of completeness 





Of all the possible values obtained for an inverse trigonometric function, one particular one 
is called the “Principal Value”, ft is the unique value which lies in a specified range 
described below, the explanation of which is best dealt with in connection with differential 
calculus. 

To indicate such a principal value, we use the lower-case initial letter of each inverse function. 

(a) 6 = sin _1 a; lies in the range — | < 9 < |. 

(b) 6 = cos _1 a; lies in the range 0 < 6 < n. 
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(c) 9 = tan 1 x lies in the range — | < 9 < 

EXAMPLES 

1. Evaluate sin _1 (|). 

Solution 

sin _1 (|) = 30° or f . 

2. Evaluate tan _1 (— y^)- 

Solution 

tan _1 (— \/3) = —60° or — |. 

3. Write down a formula for u in terms of v in the case when 

v — 5 cos(l — 7 u). 


Solution 

Dividing by 5 gives 

- = cos(l — 7 u). 
5 

Taking the inverse cosine gives 


Cos 1 ( - ) = 1 — 7u. 


Subtracting 1 from both sides gives 


Cos 1 ( - ) — 1 = —7 u. 


Dividing both sides by —7 gives 


u = — - 


Cos 


-i 


3.3.3 EXERCISES 

1. If powers of 6 higher then three can be neglected, find an approximation for the function 

6 sin 6 + 2 cos 6 + 10 tan 6 
in the form of a polynomial in 9. 

2. If powers of 6 higher than five can be neglected, find an approximation for the function 

2 sin 9 — 9 cos 9 

in the form of a polynomial in 9. 
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3. If powers of 9 higher than two can be neglected, show that the function 

9 sin 9 
1 — cos 9 

is approximately equal to 2. 

4. Write down the principal values of the following: 

(a) Sin _1 l; 

(b) Sin -1 (-|); 

(c) Cos-q-f); 

(d) Tan -1 5; 

(e) Tan _1 (— \/3); 

( f ) Cos^(-^). 

5. Solve the following equations for x in the interval 0 < x < 360°: 

(a) 

tana; = 2.46 

(b) 

cos a; = 0.241 

(c) 

sin a: = —0.786 

(d) 

tana; = —1.42 

(e) 

cos a; = —0.3478 

(f) 

sin a; = 0.987 


Give your answers correct to one decimal place. 

6. Solve the following equations for the range given, stating your final answers in degrees 
correct to one decimal place: 

(a) sin 2a; = —0.346 for 0 < x < 360°; 

(b) tan 3a; = 1.86 for 0 < x < 180°; 

(c) cos 2a; = —0.57 for —180° < x < 180°; 

(d) cos 5a; = 0.21 for 0 < x < 45°; 

(e) sin 4a; = 0.78 for 0 < x < 180°. 

7. Write down a formula for u in terms of v for the following: 

(a) v = sin at; 

(b) v = cos 2 u; 

(c) v = tan(w + 1). 


5 



8. If x is positive, show diagramatically that 


(a) 

(b) 


sin l x ± cos l x = 


7T 
2 ’ 


sin l x = cos 1 ' 


1 — x 2 . 


3.3.4 ANSWERS TO EXERCISES 

1. *f-0 2 ±160±2. 

9 Q i 03 6,5 

& + T “ 40- 

3. Substitute approximations for sin 6 and cos 6. 

4. (a) f; (b) -f; (c) f ; (d) 1.373; (e) -f; (f) f . 

5. (a) 67.9° or 247.9°; 

(b) 76.1° or 283.9°; 

(c) 231.8° or 308.2°; 

(d) 125.2° or 305.32°; 

(e) 110.4° or 249.6°; 

(f) 80.8° or 99.2° 

6. (a) 100.1°, 169.9°, 280.1°, 349.9° 

(b) 20.6°, 80.6°, 140.6° 

(c) ±62.4°, ±117.6° 

(d) 15.6° 

(e) 32.2°, 102.8°, 122.2° 

7. (a) u = Sin _1 r;; (b) u = |Cos -1 w; (c) u = Tan -1 v — 1. 

8. A suitable diagram is 
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UNIT 3.4 - TRIGONOMETRY 4 
SOLUTION OF TRIANGLES 


3.4.1 INTRODUCTION 

The “solution of a triangle” is defined to mean the complete set of data relating to the 
lengths of its three sides and the values of its three interior angles. It can be shown that 
these angles always add np to 180°. 

If a sufficient amount of information is provided about some of this data, then it is usually 
possible to determine the remaining data. 

We shall use a standardised type of diagram for an arbitrary triangle whose “vertices” (i.e. 
corners) are A,B and C and whose sides have lengths a, b and c. It is as follows: 


C 



The angles at A,B and C will be denoted by A, B and C. 

3.4.2 RIGHT-ANGLED TRIANGLES 

Right-angled triangles are easier to solve than the more general kinds of triangle because all 
we need to use are the relationships between the lengths of the sides and the trigonometric 
ratios sine, cosine and tangent. An example will serve to illustrate the technique: 

EXAMPLE 

From the top of a vertical pylon, 15 meters high, a guide cable is to be secured into the 
(horizontal) ground at a distance of 7 meters from the base of the pylon. 

What will be the length of the cable and what will be its inclination (in degrees) to the 
horizontal ? 
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Solution 



From Pythagoras’ Theorem, the length of the cable will be 

V7 2 + 15 2 ~ 16.55m. 


15m 


The angle of inclination to the horizontal will be 9, where 


tan# 


15 

T' 


Hence, 6 ~ 65°. 

3.4.3 THE SINE AND COSINE RULES 

Two powerful tools for the solution of triangles in general may be stated in relation to the 
earlier diagram as follows: 

(a) The Sine Rule 

a b c 

sin A sin B sin C 

(b) The Cosine Rule 


a 2 = b 2 + c 2 — 2 be cos A; 
b 2 = c 2 + a 2 — 2 ca cos R; 
c 2 = a 2 + b 2 — 2 ab cos C. 

Clearly, the last two of these are variations of the first. 
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We also observe that, whenever the angle on the right-hand-side is a right-angle, the Cosine 
Rule reduces to Pythagoras’ Theorem. 

The Proof of the Sine Rule 

In the diagram encountered earlier, suppose we draw the perpendicular ( of length h) from 
the vertex C onto the side AB. 



sin B sin A 

Clearly, the remainder of the Sine Rule can be obtained by considering the perpendicular 
drawn from a different vertex. 


The Proof of the Cosine Rule 

Using the same diagram as for the Sine Rule, we can assume that the side AB has lengths 
x and c — x either side of the foot of the perpendicular drawn from C. Hence 


and, at the same time, 


h 2 = b 2 - x 2 


Ji 2 = a 2 — (c — x) 


Expanding and equating the two expressions for h 2 , we obtain 

T 2 2 2 2,0 2 

o— x = a — c+ lex — x 

that is 


a 2 = 6 2 + c 2 — 2 xc. 


But x = b cos A, and so 


a 2 = b 2 + c 2 — 2 be cos A. 
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EXAMPLES 


1. Solve the triangle ABC in the case when A = 20°, B = 30° and c = 10cm. 

Solution 

Firstly, the angle C = 130° since the interior angles must add up to 180°. 
Thus, by the Sine Rule, we have 


That is, 


These give the results 


a b 10 

sin 20° sin 30° sin 130° 


a b 10 
0.342 “ (15 “ 0.766 


a = 


b = 


10 x 0.342 

0.766 
10 x 0.5 
0.766 


= 4.47cm 


== 6.53cm 


2. Solve the triangle ABC in the case when b = 9cm, c = 5cm and A = 70°. 

Solution 

In this case, the information prevents us from using the Sine Rule immediately, but the 
Cosine Rule can be applied as follows: 


giving 


Hence 


a 2 = 25 + 81 - 90 cos 70° 

a 2 = 106 - 30.782 = 75.218 
a ~ 8.673cm ~ 8.67cm 


Now we can use the Sine Rule to complete the solution 

8.673 9 _ 5 

sin 70° sin B sin C 


Thus, 


sinR = 


9 x sin 70° 9x0.940 


8.673 


8.673 


~ 0.975 


This suggests that B ~ 77.19° in which case C — 180° — 70° — 77.19° ~ 32.81° but, 
for the moment, we must also allow the possibility that B ~ 102.81° which would give 
C ~ 7.19° 


However, we can show that the alternative solution is unacceptable because it is not 
consistent with the whole of the Sine Rule statement for this example. Thus the only 
solution is the one for which 


a ~ 8.67cm, B ~ 77.19°, C ~ 32.81° 


Note: It is possible to encounter examples for which more than one solution does exist. 
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3.4.4 EXERCISES 


Solve the triangle ABC in the following cases: 

1. c = 25cm, A = 35°, B = 68°. 

2. c = 23cm, a = 30cm, C = 40°. 

3. b = 4cm, c = 5cm, A = 60°. 

4. a = 21cm, b = 23cm, c = 16cm. 

3.4.5 ANSWERS TO EXERCISES 

1. a ~ 14.72cm, b ~ 23.79cm, C ~ 77°. 

2. A ~ 56.97°, B ~ 83.03°, b = 35.52cm; 

OR 

A ~ 123.03°, B ~ 16.97°, b ~ 10.44cm. 

3. a ~ 4.58cm, B ~ 49.11°, C ~ 70.89°. 

4. A ~ 62.13°, B ~ 75.52°, C ~ 42.35°. 
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UNIT 3.5 - TRIGONOMETRY 5 

TRIGONOMETRIC IDENTITIES AND WAVE FORMS 
3.5.1 TRIGONOMETRIC IDENTITIES 

The standard trigonometric functions can be shown to satisfy a certain group of relationships 
for any value of the angle 9. They are called “trigonometric identities” . 

ILLUSTRATION 

Prove that 

cos 2 d + sin 2 d = 1. 

Proof: 


The following diagram was first encountered in Unit 3.1 



From the diagram, 

But, by Pythagoras’ Theorem, 
In other words, 

That is, 


cos 9 = — and sin 9 = y. 
h h 


x 2 + y 2 = h 2 


x\ 2 ( y\ 2 

h) + \h) = ' 


cos 2 d + sin 2 d = 1. 


It is also worth noting various consequences of this identity: 

(a) cos 2 9 = 1 — sin 2 d; (rearrangement). 

(b) sin 2 d = 1 — cos 2 9\ (rearrangement). 

(c) sec 2 9 = 1 + tan 2 d; (divide by cos 2 6*). 

(d) cosec 2 9 = 1 + cot 2 ^; (divide by sin 2 d). 

Other Trigonometric Identities in common use will not be proved here, but they are listed 
for reference. However, a booklet of Mathematical Formulae should be obtained. 
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sec 9 = 


cosec# = 


cot 9 = 


cos 9 sin 9 tan 9 

cos 2 9 + sin 2 0 = 1 , 1 + tan 2 0 = sec 2 0 1 + cot 2 0 = 

sin(v4 + B) = sin A cos B + cos A sin B 
sin(A — B) = sin A cos B — cos A sin B 
cos(A + B) = cos A cos B — sin A sin B 
cos(A — B) = cos A cos B + sin A sin B 

, , tan A + tan B 

tanM + B ) = - - 

1 — tan A tan B 

, . tan A — tan B 

tan (A - B) = 


1 + tan A tan B 
sin 2 A = 2 sin A cos A 

cos2A = cos 2 A — sin 2 A = 1 — 2sin 2 A = 2cos 2 A 

2 tan A 


tan 2 A 


1 — tan 2 A 


sin A = 2 sin - A cos - A 
2 2 

11 11 
cos A = cos 2 -A — sin 2 -A = 1 — 2sin 2 -A = 2cos 2 - 
2 2 2 2 

2 tan l A 

tan A = 

1 - tan 2 ^ 


sin A + sin B = 2 sin 
sin A — sin B = 2 cos 
cos A + cos B = 2 cos 
cos A — cos B = —2 sin 


A + fi 
2 

A + 5 
2 

A + fi 
2 

A + 5 


cos 


Sill 


cos 


sin 


2 

A-B 

2 

A-B 

2 

A-B 


sin A cos B = - [sin (A + B) + sin(A — 5)] 

cos A sin B = ^ [sin (.4 + £>) — sin(4 — £?)] 

cos A cos B = ^ [cos(4 + 5) + cos(4 — B )] 

sin A sin B = ^ [cos(4 — B) — cos(4 + 5)] 

sin 34 = 3 sin A — 4sin 3 v4 
cos 3 A = 4cos 3 4 — 3 cos A 


cosec 2 9 


- 1 


A- 1 
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EXAMPLES 


1. Show that ^ 

sin 2 2a; = -(1 — cos Ax). 

Solution 

From the standard trigonometric identities, we have 

cos 4a; = 1 — 2sin 2 2a; 

on replacing A by 2a;. 

Rearranging this new identity, gives the required result. 

2. Show that 

sin ^ + 2 ) ^ COS 

Solution 

The left hand side can be expanded as 

n 7T , 7T 

sin 6 cos — b cos u sm — ; 

2 2 ’ 

and the result follows, because cos | = 0 and sin | = 1. 

3. Simplify the expression 

sin 2a + sin 3a 
cos 2a — cos 3a 

Solution 

Using separate trigonometric identities in the numerator and denominator, the expres- 
sion becomes 



4. Express 2 sin 3a; cos 7x as the difference of two sines. 

Solution 

2 sin 3a; cos 7x = sin(3a; + 7x) + sin(3a; — 7x). 

Hence, 

2 sin 3a; cos 7x = sin 10a; — sin 4a;. 
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3.5.2 AMPLITUDE, WAVE-LENGTH, FREQUENCY AND PHASE ANGLE 


In the scientific applications of Mathematics, importance is attached to trigonometric func- 
tions of the form 

A sin (out + a) and A cos(out + a), 
where A, ou and a are constants and t is usually a time variable. 


It is useful to note, from trigonometric identities, that the expanded forms of the above two 
functions are given by 


and 


A sm(out + a) = A sin out cos a + A cos out sin a 
A cos(u it + a) = A cos out cos a — A sin out sin a. 


(a) The Amplitude 

In view of the fact that the sine and the cosine of any angle always lies within the closed in- 
terval from —1 to +1 inclusive, the constant, A, represents the maximum value (numerically) 
which can be attained by each of the above trigonometric functions. 

A is called the “amplitude” of each of the functions. 

(b) The Wave Length (Or Period) 

If the value, t, increases or decreases by a whole multiple of — , then the value, ( out + a), 
increases or decreases by a whole multiple of 27 r; and, hence, the functions remain unchanged 
in value. 

A graph, against t, of either A sin (out + a) or A cos(out + a) would be repeated in shape at 
regular intervals of length — . 

The repeated shape of the graph is called the “wave profile” and — is called the 
“wave-length”, or “period” of each of the functions. 

(c) The Frequency 

If t is indeed a time variable, then the wave length (or period) represents the time taken to 
complete a single wave-profile. Consequently, the number of wave-profiles completed in one 
unit of time is given by | L . 

j- is called the “frequency” of each of the functions. 

Note: 

The constant ou itself is called the “angular frequency”; it represents the change in the 
quantity ( out + a) for every unit of change in the value of t. 

(d) The Phase Angle 

The constant, a, affects the starting value, at t — 0, of the trigonometric functions 
A sin (cut + a) and Acos(out + a). Each of these is said to be “out of phase”, by an amount, 
a, with the trigonometric functions A sin cut and A cos out respectively. 
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a is called the “phase angle” of each of the two original trigonometric functions; but it can 
take infinitely many values differing only by a whole multiple of 360° (if working in degrees) 
or 2n (if working in radians ). 

EXAMPLES 

1. Express sin t + \/3cos t in the form Asin(t + a), with a in degrees, and hence solve the 
equation, 


sin t + V3 cos t — 1 


for t in the range 0° < t < 360°. 

Solution 

We require that 

sin t + \/?> cos t = A sin t cos a + A cos t sin a 

Hence, 

A cosa = 1 and A sin a = \/3, 

which gives A 2 = 4 (using cos 2 a + sin 2 a = 1) and also tana = y/3. 
Thus, 


A = 2 and a = 60° (principal value). 

To solve the given equation, we may now use 

2sin(f + 60°) = 1, 

so that 

t + 60° = Sin -1 ^ = 30° + k360° or 150° + A;360°, 

where k may be any integer. 

For the range 0° < t < 360°, we conclude that 

t = 330° or 90°. 

2. Determine the amplitude and phase-angle which will express the trigonometric function 
a sin out + b cos out in the form A sin (cat + a). 

Apply the result to the expression 3 sin 5f — 4 cos 5 1 stating a in degrees, correct to one 
decimal place, and lying in the interval from —180° to 180°. 

Solution 

We require that 

Asin(c<A + a) = a sin cat + b cos cut; 
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and, hence, from trigonometric identities, 

A sin a = b and A cos a = a. 

Squaring each of these and adding the results together gives 

A 2 = a 2 + b 2 that is A = \J a 2 + b 2 . 


Also, 


which gives 


A sin a b 

A cos a a ’ 

-l 

a = tan 

a 


but the particular angle chosen must ensure that sin a = and cos a = j have the 
correct sign. 

Applying the results to the expression 3 sin 5t — 4 cos 5 1, we have 

A = \/3 2 + 4 2 


a = tan 1 ( — - 


But sin a (= — is negative and cos a (= 0 is positive so that a may be taken as an 
angle between zero and —90°; that is a = —53.1°. 


We conclude that 


3. Solve the equation 


3sin5t — 4cos5t = 5sin(5t — 53.1°). 


4 sin 2t + 3 cos 2t = 1 


for t in the interval from —180° to 180°. 

Solution 

Expressing the left hand side of the equation in the form Asin(2f + a), we require 

Q 

A _ . /A 2 i Q2 _ K 


A = v4 2 + 3 2 = 5 and a = tan 

4 

Also since (= |) is positive and cosce (^= |^) is positive so that a may be taken as an 
angle in the interval from zero to 90°. 

Hence, a = 36.87° and the equation to be solved becomes 

5 sin(2t + 36.87°) = 1. 

Its solutions are obtained by making t the “subject” of the equation to give 

t = - [sin -1 ^ — 36.87°] . 


The possible values of Sin _1 | are 11.53° + A;360° and 168.46° + /c360°, where k may be 
any integer. But, to give values of t which are numerically less than 180°, we may use 
only k = 0 and k — 1 in the first of these and k = 0 and k — —1 in the second. 

The results obtained are 

t = -12.67°, 65.80°, 167.33° and - 114.21° 
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3.5.3 EXERCISES 


1. Simplify the following expressions: 


(a) 

(b) 


(1 + cosa;)(l — cos a;); 


(1 + sin a;) 2 — 2 sina;(l + sin a;). 


2. Show that 


cos 




sin 0 


3. Express 2 sin Ax sin 5a: as the difference of two cosines. 

4. Use the table of trigonometric identities to show that 


(a) 

(b) 


sin 5a; + sin x 

= tan 3a;; 

cos 5a; + cos x 


1 — cos 2x 
1 + cos 2x 


= tan 2 a;; 


(d) 


tan x. tan 2a; + 2 = tan 2x. cot x; 


cot (a; + y) 


cot x. cot y — 1 
cot y + cot x 


5. Solve the following equations by writing the trigonometric expression on the left-hand- 
side in the form suggested, being careful to see whether the phase angle is required in 
degrees or radians and ensuring that your final answers are in the range given: 


(a) cos t + 7 sin 7 = 5, 0° < t < 360°, (transposed to the form Acos(t — a), with a in 

degrees. 

(b) y^cos t — sin t = 1, 0° < t < 360°, (transposed to the form Acos(t + a), with a 

in degrees. 

(c) 2 sin t — cos t = 1, 0 < t < 2n, (transposed to the form 7Lsin(f — a), with a in 

radians. 

(d) 3 sin t — 4cost = 0.6, 0° < t < 360°, (transposed to the form 7lsin(f — a), with a 

in degrees. 


6. Determine the amplitude and phase-angle which will express the trigonometric function 
acosut + b sin ujt in the form 7tcos(u;t + a). 

Apply the result to the expression 4 cos 5 1 — 4\/3 sin 5 1 stating a in degrees and lying in 
the interval from —180° to 180°. 
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7. Solve the equation 


2 cos 3t + 5 sin 3t = 4 

for t in the interval from zero to 360°, expressing t in decimals correct to two decimal 
places. 

3.5.4 ANSWERS TO EXERCISES 

1. (a) sin 2 a;; (b) cos 2 a;. 

2. Use the cos(A - B) formula to expand left hand side. 

3. cos a; — cos 9a:. 

4. (a) Use the formulae for sinA + sinB and cosA + cosB; 

(b) Use the formulae for cos 2a; to make the l’s cancel; 

(c) Both sides are identically equal to — jtypy; 

(d) Invert the formula for tan (a: + y). 

5. (a) 36.9°, 126.9°; 

(b) 19.5°, 270°; 

(c) 0, 3.14; 

(d) 226.24° 

6 . 

A = V a 2 + b 2 , and a = tan -1 



7 . 


4 cos 5 1 — 4\/3sin5f = 8cos(5f + 60°). 


^008(3 1 - 68.20°) = 4 or V29sin(3f + 21.80°) = 4 


give 


t = 8.72°, 36.74°, 156.74° and 276.74' 
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UNIT 4.1 - HYPERBOLIC FUNCTIONS 
DEFINITIONS, GRAPHS AND IDENTITIES 

4.1.1 INTRODUCTION 

In this section, we introduce a new group of mathematical functions, based on the functions 

e x and e~ x 

whose properties resemble, very closely, those of the standard trigonometric functions. But, 
whereas trigonometric functions can be related to the geometry of a circle (and are sometimes 
called the “circular functions”), it can be shown that the new group of functions are related 
to the geometry of a hyperbola (see unit 5.7). Because of this, they are called “hyperbolic 
functions” . 

4.1.2 DEFINITIONS 

(a) Hyperbolic Cosine 

The hyperbolic cosine of a number, x, is denoted by cosh x and is defined by 

g X j g X 

cosh x = . 


Note: 

The name of the function is pronounced “cosh” . 

(b) Hyperbolic Sine 

The hyperbolic sine of a number, x, is denoted by sinh x and is defined by 

g 

sinh x = . 

2 


Note: 

The name of the function is pronounced “shine” 

(c) Hyperbolic Tangent 

The hyperbolic tangent of a number, x, is denoted by tanh x and is defined by 

sinh x 

tanh x = . 

cosh x 


Notes: 

(i) The name of the function is pronounced “than” . 
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(ii) In terms of exponentials, it is easily shown that 


e x — e x e 2x — 1 

farm x = = — . 

e x + e~ x e 2x + 1 


(d) Other Hyperbolic Functions 


Other, less commonly used, hyperbolic functions are defined as follows: 


(i) Hyperbolic secant, pronounced “shek”, is defined by 


sechr = . 

cosh x 


(ii) Hyperbolic cosecant, pronounced “coshek” is defined by 


cosechr = — - — . 

smhr 


(iii) Hyperbolic cotangent, pronounced “coth” is defined by 


1 cosh x 

cotlix = — = — - — . 

tanh x smh x 
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4.1.3 GRAPHS OF HYPERBOLIC FUNCTIONS 


It is useful to see the graphs of the functions cosh x , sinh x and tanh x drawn with reference 
to the same set of axes. It can be shown that they are as follows: 



Note: 

We observe that the graph of cosh re exists only for y greater than or equal to 1; and that 
graph of tanh x exists only for y lying between —1 and +1. The graph of sinh x, however, 
covers the whole range of x and y values from — oo to +oo. 

4.1.4 HYPERBOLIC IDENTITIES 

It is possible to show that, to every identity obeyed by trigonometric functions, there is a 
corresponding identity obeyed by hyperbolic functions though, in some cases, the comparison 
is more direct than in other cases. 

ILLUSTRATIONS 

1 . 


e x = cosh re + sinh x. 


Proof 

This follows directly from the definitions of cosh x and sinh x. 

2 . 


e x = cosh x — sinh x. 


Proof 

Again, this follows from the definitions of cosh x and sinh x. 
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3 . 


cosh 2 a; — sinh 2 .x' = 1. 


Proof 

This follows if we multiply together the results of the previous two illustrations since 
e x .e~ x = 1 and (cosh a: + sinh x) (cosh x — sinh x) = cosh 2 a; — sinh 2 x. 

Notes: 

(i) Dividing throughout by cosh 2 a; gives the identity 

1 — tanh 2 x = sech 2 ax 

(ii) Dividing throughout by sinh 2 x gives the identity 

coth 2 .r — 1 = cosech 2 ax 


4 . 


sinh (a: + y) = sinh x cosh y + cosh x sinh y. 


Proof: 

The right hand side may be expressed in the form 


e x — e x e y + e y e x + e x e y — e y 
x • x + x • x • 


which expands out to 


e {x+y) _j_ e {x-y) _ p(-x+y) _ p(-x-y) p(x+y) _ p (x-y ) 4. p (-x+y) _ p (-x-y) 


+ 


e v*-y) + e t 


4 4 

and this simplifies to 

2 e ( x +y) — 2e ( '~ x ~ y ' ) ^ x+y ) — e ~( x +y) 


= sinh(x + y). 


cosh(x + y) = cosh x cosh y + sinh x sinh y. 


Proof 

The proof is similar to the previous illustration. 


4 



6 . 


tanh(x + y) 


tanh x + tanh y 
1 — tanh x tanh y 


Proof 

The proof again is similar to that in Illustration No. 4. 

4.1.5 OSBORN’S RULE 

Many other results, similar to those previously encountered in the standard list of trigono- 
metric identities can be proved in the same way as for Illustration No. 4 above; that is, we 
substitute the definitions of the appropriate hyperbolic functions. 

However, if we merely wish to write down a hyperbolic identity without proving it, we may 
use the following observation due to Osborn: 

Starting with any trigonometric identity, change cos to cosh and sin to sinh. Then, if the 
trigonometric identity contains (or implies) two sine functions multiplied together, change 
the sign in front of the relevant term from + to — or vice versa. 

ILLUSTRATIONS 

1 . 


cos 2 a; + sin 2 a: = 1, 

which leads to the hyperbolic identity 

cosh 2 a; — sinh 2 .r = 1 

since the trigonometric identity contains two sine functions multiplied together. 

2 . 


sin(x — y) = sin x cos y — cos x sin y, 

which leads to the hyperbolic identity 

sinh (a; — y) = sinh x cosh y — cosh x sinh y 

in which no changes of sign are required. 

3. 

sec 2 x = 1 + tan 2 a:, 
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which leads to the hyperbolic identity 

sech 2 a; = 1 — tanh 2 x 

since tan 2 a; in the trigonometric identity implies that two sine functions are multiplied 
together; that is, 


2 sura: 

tan x = — . 

cos 2 # 


4.1.6 EXERCISES 

1. If 

L = 2Csmh^, 

determine the value of L when H = 63 and C = 50 

2. If 

2 , „ r , 6.3d 

v = 1.8Ltanh , 

L 

determine the value of v when d = 40 and L = 315. 

3. Use Osborn’s Rule to write down hyperbolic identities for 
(a) 


(b) 


sinh 2A; 


cosh 2 A. 


4. Use the results of the previous question to simplify the expression 

1 + sinh 2A + cosh 2 A 
1 — sinh 2 A — cosh 2 A 


5. Use Osborn’s rule to write down the hyperbolic identity which corresponds to the 
trigonometric identity 

2 sin x sin y = cos(a; — y) — cos(rr + y) 


and prove your result. 

6. If 

show that 


a = c cosh x and b = c sinh X, 
(a + b) 2 e~ 2x = a 2 — b 2 = c 2 . 
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4.1.7 ANSWERS TO EXERCISES 

1. 67.25 

2. 19.40 

3. (a) 

sinh 2 A = 2 sinh A cosh A; 

(b) 

cosh 2 A = cosh 2 A + sinh 2 A = 2cosh 2 A — 1 = 1 + 2sinh 2 A 

4. 

— cothA. 

5. 

—2 sinh x sinh y = cosh(A — y) — cosh(a; + ?/). 
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UNIT 4.2 - HYPERBOLIC FUNCTIONS 2 
INVERSE HYPERBOLIC FUNCTIONS 

4.2.1 - INTRODUCTION 


The three basic inverse hyperbolic functions are Cosh 1 x, Sinh l x and Tanh l x. 
It may be shown that they are given by the following formulae: 

(a) 


(b) 


Cosh l x = ± ln(x + Vx 2 — 1); 


Sinh l x = ln(x + \J x 2 + 1); 


(c) 


Tanh L x 



1 + x 
1 — x 


Notes: 

(i) The positive value of CoslV 1 ^ is called the ‘principal value” and is denoted by coslV 1 ^ 
(using a lower-case c). 

(ii) Sinh - l x and Tanh - 1 x have only one value but, for uniformity, we customarily denote 
them by sinlc'x and tanh - [ x. 

4.2.2 THE PROOFS OF THE STANDARD FORMULAE 
(a) Inverse Hyperbolic Cosine 

If we let y = Cosh -1 ;r, then 


, e y + e~ y 
x = coshy = 

Hence, 


2x = e y + e y . 


1 



On rearrangement, 


(e y f - 2xe y + 1 = 0, 


which is a quadratic equation in e y having solutions, from the quadratic formula, given by 


e y = 


2x ± a/ 4r 2 — 4 


= X ± Vx 2 — 1. 


Taking natural logarithms of both sides gives 


y 


= ln(x ± Vx 2 — 1). 


However, the two values x + \Jx 2 — 1 and x — \/x 2 — 1 are reciprocals of each other, since 
their product is the value, 1; and so 


y — ± ln(x + \Jx 2 — 1). 


(b) Inverse Hyperbolic Sine 

If we let y = Sinl+'x, then 


x = sinh y 


e y - e~ y 


2 


Hence, 


2x — e y — e y , 


or 


( e v f - 2xe y -1 = 0, 


which is a quadratic equation in e y having solutions, from the quadratic formula, given by 


2 



e 


y — 


2x ± y/Ax 2 + 4 
2 


= x ± \/a: 2 + 1. 


However, a; — \/a; 2 + 1 has a negative value and cannot, therefore, be equated to a power of 
e, which is positive. Hence, this part of the expression for e y must be ignored. 

Taking natural logarithms of both sides gives 


y = ln(x + V x 2 + 1). 


(c) Inverse Hyperbolic Tangent 

If we let y = Tanh _1 x, then 


, e y - e~ y 

x = tanh y = 

e y + e~ y 


e 2y - 1 
e 2y + 1 ‘ 


Hence, 


x 



— e 2y 


- 1 , 


giving 


e 2 » = 1±£. 

1 — x 


Taking natural logarithms of both sides, 


V = 



1 + x 

1 — X 


Note: 

The graphs of inverse hyperbolic functions are discussed fully in Unit 10.7, but we include 
them here for the sake of completeness: 


3 




(c) y = Tanh 1 x 



4.2.3 EXERCISES 

1. Use the standard formulae to evaluate (a) sinh _1 7 and (b) cosh _1 9. 

2. Express cosh 2x and sinh 2x in terms of exponentials and hence solve, for x, the equation 


2 cosh 2x — sinh 2x = 2. 


3. Obtain a formula which equates cosech l x to the natural logarithm of an expression in 
x, distinguishing between the two cases x > 0 and x < 0. 

4. If t — tanh(x/2), prove that 

(a) 

sinh x = 


and 

(b) 


cosh x 


l + t 2 
1 -t 2 ‘ 


Hence solve, for x, the equation 


7 sinh x + 20 cosh x = 24. 
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4.2.4 ANSWERS TO EXERCISES 


1. (a) 

In (7 + V49 + 1) ~ 2.644; 

(b) 

ln(9 + y/Sl - 1) ~ 2.887 

2 . 

(e 2 *) 2 - 4e 2x + 3 = 0, 

which gives e 2x = 1 or 3 and hence x = 0 or ^ In 3 ~ 0.549. 
3. If x > 0, then 


cosech 1 x 


In 


1 + y/l + X 2 

X 


If x < 0, then 


cosech x x 


1 - y/\ + X 2 

In 


x 


4. Use 


sinh x = 


2tanh(x/2) 

sech 2 (x/2) 


and 


cosh x = 


sech 2 (x/2) 


This gives t = or t = A and hence x ~ —1.099 or 0.368 which agrees with the 
solution obtained using exponentials. 
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UNIT 5.1 - GEOMETRY 1 


CO-ORDINATES, DISTANCE AND GRADIENT 
5.1.1 CO-ORDINATES 
(a) Cartesian Co-ordinates 

The position of a point, P, in a plane may be specified completely if we know its perpendic- 
ular distances from two chosen fixed straight lines, where we distinguish between positive 
distances on one side of each line and negative distances on the other side of each line. 

It is not essential that the two chosen fixed lines should be at right-angles to each other, but 
we usually take them to be so for the sake of convenience. 

Consider the following diagram: 


V 

{x,y) 


0 


X 


The horizontal directed line, Oa;, is called the “x-axis” and distances to the right 
origin (point O) are taken as positive. 

The vertical directed line, O y, is called the “y-axis” and distances above the origin 
O) are taken as positive. 

The notation (x, y) denotes a point whose perpendicular distances from O y and Ox are x 
and y respectively, these being called the “cartesian co-ordinates” of the point. 

(b) Polar Co-ordinates 

An alternative method of fixing the position of a point P in a plane is to choose first a point, 
O, called the “pole” and directed line , Ox, emanating from the pole in one direction only 
and called the “initial line”. 


of the 
(point 
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Consider the following diagram: 


(r, 9 ) 


The position of P is determined by its distance r from the pole and the angle, 9 which the 
line OP makes with the initial line, measuring this angle positively in a counter-clockwise 
sense or negatively in a clockwise sense from the initial line. The notation (r, 9) denotes the 

“polar co-ordinates” of the point. 

5.1.2 THE RELATIONSHIP BETWEEN POLAR AND CARTESIAN CO-ORDINATES 

It is convenient to superimpose the diagram for Polar Co-ordinates onto the diagram for 
Cartesian Co-ordinates as follows: 


(r, 9) 


The trigonometry of the combined diagram shows that 

(a) x = r. cos 9 and y = r. sin 9 ; 

(b) r 2 = x 2 + y 2 and 9 = tan -1 ^. 

EXAMPLES 

1. Express the equation 

2x + 3y = 1 

in polar co-ordinates. 

Solution 

Substituting for x and y separately, we obtain 

2 r cos 9 + 3r sin 9 — 1 

That is 

1 

2 cos 9 + 3 sin 9 
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2. Express the equation 


r = sin 9 


in cartesian co-ordinates. 

Solution 

We could try substituting for r and 9 separately, but it is easier, in this case, to rewrite 
the equation as 

r 2 = r sin 9 


which gives 


x 2 + y 2 = y 


5.1.3 THE DISTANCE BETWEEN TWO POINTS 

Given two points (xi,yi) and (^ 2 , 1 / 2 ), the quantity | x 2 — X\ | is called the “horizontal sep- 
aration” of the two points and the quantity | y-i ~ U\ | is called the “vertical separation” 
of the two points, assuming, of course, that the rr-axis is horizontal. 

The expressions for the horizontal and vertical separations remain valid even when one or 
more of the co-ordinates is negative. For example, the horizontal separation of the points 
(5, 7) and (—3, 2) is given by j — 3 — 5 |= 8 which agrees with the fact that the two points 
are on opposite sides of the y- axis. 

The actual distance between (xi,y±) and (x 2 , 2 / 2 ) is easily calculated from Pythagoras’ The- 
orem, using the horizontal and vertical separations of the points. 



In the diagram, 
That is, 
giving 


PQ 2 = PR 2 + RQ 2 . 
d 2 = (x 2 -x 1 ) 2 + (y 2 -y 1 ) 2 , 
d = \j{x 2 ~ aq) 2 + (|/2 - yi) 2 - 


Note: 

We do not need to include the modulus signs of the horizontal and vertical separations 
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because we are squaring them and therefore, any negative signs will disappear. For the same 
reason, it does not matter which way round the points are labelled. 

EXAMPLE 

Calculate the distance, d, between the two points (5, —3) and (—11, —7). 

Solution 

Using the formula, we obtain 

d= ^(5 + 11)2 + (-3 + 7) 2 . 

That is, 

d = V256 + 16 = V272 = 16.5 


5.1.4 GRADIENT 


The gradient of the straight-line segment, PQ, joining two points P and Q in a plane is 
defined to be the tangent of the angle which PQ makes with the positive x-direction. 


In practice, when the co-ordinates of the two points are P(aq, yi) and Q(x 2 , 1 / 2 ), the gradient, 
m, is given by either 

V2 - Ui 

m = 

x 2 - Xi 
or 

y 1 - V2 

m = , 

xi - x 2 


both giving the same result. 


This is not quite the same as the ratio of the horizontal and vertical separations since we 
distinguish between positive gradient and negative gradient. 

EXAMPLE 

Determine the gradient of the straight-line segment joining the two points (8, —13) and 
(—2, 5) and hence calculate the angle which the segment makes with the positive x-direction. 

Solution 


m = 


5 + 13 
- 2-8 


-13-5 
8 + 2 


- 1.8 


Hence, the angle, 6, which the segment makes with the positive x-direction is given by 

tand = —1.8 


Thus, 


6 = tan x (— 1.8) ~ 119°. 


5.1.5 EXERCISES 

1. A square, side d, has vertices 0,A,B,C (labelled counter-clockwise) where O is the pole 
of a system of polar co-ordinates. Determine the polar co-ordinates of A,B and C when 
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(a) OA is the initial line; 

(b) OB is the initial line. 

2. Express the following cartesian equations in polar co-ordinates: 

(a) 

x 2 + y 2 -2y = 0; 

(b) 

y 2 = 4a(a — x). 

3. Express the following polar equations in cartesian co-ordinates: 

(a) 

r 2 sin 29 = 3; 

(b) 

r — 1 + cos 6. 


4. Determine the length of the line segment joining the following pairs of points given in 
cartesian co-ordinates: 

(a) (0, 0) and (3, 4); 

(b) (-2,-3) and (1,1); 

(c) (-4,-6) and (-1.-2); 

(d) (2,4) and (—3,16); 

(e) (-1,3) and (11,-2). 

5. Determine the gradient of the straight-line segment joining the two points (—5, —0.5) 
and (4.5, —1). 


5.1.6 ANSWERS TO EXERCISES 

1. (a) A(d, 0), B(d,\/2, |), C(d, f); 

(b) A(d,-f),B(dV^,0),C(d,f). 

2. (a) r = 2 sin#; 

(b) r 2 sin 2 # = 4a (a — r cos 9). 

3. (a) xy = §; 

(b) x 4 + y 4 + 2 x 2 y 2 — 2x 3 — 2 xy 2 — y 2 = 0. 

4. (a) 5; (b) 5; (c) 5; (d) 13; (e) 13. 
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UNIT 5.2 - GEOMETRY 2 


THE STRAIGHT LINE 

5.2.1 PREAMBLE 

It is not possible to give a satisfactory diagramatic definition of a straight line since the 
attempt is likely to assume a knowledge of linear measurement which, itself, depends on 
the concept of a straight line. For example, it is no use defining a straight line as “the 
shortest path between two points” since the word “shortest” assumes a knowledge of linear 
measurement. 

In fact, the straight line is defined algebraically as follows: 

DEFINITION 

A straight line is a set of points, (x,y), satisfying an equation of the form 

ax + by + c = 0 

where a, b and c are constants. This equation is called a “linear equation” and the symbol 
(x, y ) itself, rather than a dot on the page, represents an arbitrary point of the line. 

5.2.2 STANDARD EQUATIONS OF A STRAIGHT LINE 
(a) Having a given gradient and passing through the origin 



Let the gradient be m; then, from the diagram, all points (x, y ) on the straight line (but no 
others) satisfy the relationship, 


x 


That is, 


y = mx 


which is the equation of this straight line. 
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EXAMPLE 

Determine, in degrees, the angle, 9, which the straight line, 

V3y = x , 


makes with the positive x-direction. 

Solution 

The gradient of the straight line is given by 

tan 6 


1 

vr 


Hence, 

6 = tan -1 ^^ = 30°. 

Vz 


(b) Having a given gradient, and a given intercept on the vertical axis 



Let the gradient be m and let the intercept be c; then, in this case we can imagine that the 
relationship between x and y in the previous section is altered only by adding the number c 
to all of the y co-ordinates. Hence the equation of the straight line is 


y = mx + c. 

EXAMPLE 

Determine the gradient, m, and intercept c on the y - axis of the straight line whose equation 

is 

7x — by - 3 = 0. 


Solution 


On rearranging the equation, we have 

7 3 

y= 5 X -5- 


Hence, 


m = 


7 

5 


2 




and 


c = — 


3 

5 


This straight line will intersect the y-axis below the origin because the intercept is negative. 

(c) Having a given gradient and passing through a given point 

Let the gradient be m and let the given point be (aq,yi). Then, 

y = rnx + c, 


where 


y i = mx i + c. 


Hence, on subtracting the second of these from the first, we obtain 


V — Vi = m(x-x i). 


EXAMPLE 

Determine the equation of the straight line having gradient | and passing through the point 
(-7,2). 


Solution 

From the formula, 

That is 


y- 2 = ^(* + 7 ). 

8y — 16 = 3a: + 21, 


giving 


8y = 3a: + 37. 


(d) Passing through two given points 

Let the two given points be (aq,yi) and (. x 2 ,y 2 )■ Then, the gradient is given by 


m = 


V2 - Vi 

X 2 -Xi 


Hence, from the previous section, the equation of the straight line is 

2/2 — 2/i . v 

y-y i = (a - ®i); 

x 2 - X\ 

but this is more usually written 


y-y\ _ x-x x 
y 2 -yi x 2 — Xi 


Note: 

The same result is obtained no matter which way round the given points are taken as (xi, y\) 
and (x 2 ,y 2 ). 
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EXAMPLE 

Determine the equation of the straight line joining the two points (—5, 3) and (2, —7), stating 
the values of its gradient and its intercept on the y-axis. 

Solution (Method 1). 


giving 


That is, 


y — 3 x + 5 
-7-3 ““ 2 + 5’ 

7(y — 3) = —10 (a; + 5). 


lOx + 7y + 29 = 0. 


Solution (Method 2). 

giving 
That is, 
as before. 


y + 7 x — 2 


3 + 7 -5-2’ 

— 7(y + 7) = 10(x — 2). 

10a; + 7y + 29 = 0, 


By rewriting the equation of the line as 

10 29 

y 7 7 

we see that the gradient is — y and the intercept on the y-axis is — y. 

(e) The parametric equations of a straight line 


In the previous section, the common value of the two fractions 

2/ — 2/i , x-xi 

and 

V2 ~y i x 2 - x i 

is called the “parameter” of the point (x, y) and is usually denoted by t. 


By equating each fraction separately to t, we obtain 

x = x 1 + (x 2 - x^t and y = Vl + (y 2 - y^t. 


These are called the “parametric equations” of the straight line while (a+yi) and (x 2 ,y 2 ) 
are known as the “base points” of the parametric representation of the line. 

Notes: 

(i) In the above parametric representation, (xi,yi) has parameter t — 0 and (. x 2 ,y 2 ) has 
parameter t — 1. 

(ii) Other parametric representations of the same line can be found by using the given base 
points in the opposite order, or by using a different pair of points on the line as base points. 
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EXAMPLES 


1. Use parametric equations to find two other points on the line joining (3, —6) and (—1, 4). 

Solution 

One possible parametric representation of the line is 

x = 3 — At y = — 6 + lOt. 

To find another two points, we simply substitute any two values of t other than 0 or 1. 
For example, with t — 2 and t — 3, 


x = —5, y — 14 and x = —9, y = 24. 


2 . 


A pair of suitable points is therefore (—5, 14) and (—9, 24). 

The co-ordinates, x and y, of a moving particle are given, at time t, by the equations 

x = 3 — At and y = 5 + 2t 

Determine the gradient of the straight line along which the particle moves. 

Solution 

Eliminating t, we have 

x — 3 y — 5 
-4 “ 2 

That is, 


2(x-3) = -4(j/-5), 


giving 



26 

T' 


Hence, the gradient of the line is 


2 

4 


1 

2 ' 


5.2.3 PERPENDICULAR STRAIGHT LINES 

The perpendicularity of two straight lines is not dependent on either their length or their 
precise position in the plane. Hence, without loss of generality, we may consider two straight 
line segments of equal length passing through the origin. The following diagram indicates 
appropriate co-ordinates and angles to demonstrate perpendicularity: 
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In the diagram, the gradient of OP = ^ and the gradient of OQ = 

Hence the product of the gradients is equal to —1 or, in other words, each gradient 
is minus the reciprocal of the other gradient. 

EXAMPLE 

Determine the equation of the straight line which passes through the point (—2, 6) and is 
perpendicular to the straight line, 


3x + by + 11 = 0. 


Solution 

The gradient of the given line is — | which implies that the gradient of a perpendicular line 
is |. Hence, the required line has equation 


giving 
That is, 


y- 6 = t ^ + 2 ), 

3y-18 = 5x + 10. 
3y = bx + 28. 


5.2.4 CHANGE OF ORIGIN 

Given a cartesian system of reference with axes Oa; and O y, it may sometimes be convenient 
to consider a new set of axes O'X parallel to Oa; and O'Y parallel to O y with new origin at 
O' whose co-ordinates are ( h , k ) referred to the original set of axes. 
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In the above diagram, everything is drawn in the first quadrant, but the relationships ob- 
tained between the old and new co-ordinates will apply in all situations. They are 

X = x — h and Y = y — k 

or 

x — X + h and y — Y + k. 


EXAMPLE 

Given the straight line, 


y = 3x + 11, 


determine its equation referred to new axes with new origin at the point (—2,5). 


Solution 


Using 
we obtain 
That is, 


x = X — 2 and y — Y + 5, 
Y + 5 = 3(X - 2) + 11. 

Y = 3X, 


which is a straight line through the new origin with gradient 3. 

Note: 

If we had spotted that the point (—2,5) was on the original line, the new line would be 
bound to pass through the new origin; and its gradient would not alter in the change of 
origin. 
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5.2.5 EXERCISES 


1. Determine the equations of the following straight lines: 

(a) having gradient 4 and intercept —7 on the y-axis; 

(b) having gradient | and passing through the point (—2,5); 

(c) passing through the two points (1,6) and (5,9). 

2. Determine the equation of the straight line passing through the point (1, —5) which is 
perpendicular to the straight line whose cartesian equation is 

x + 2y = 3. 

3. Given the straight line 

y = Ax + 2, 

referred to axes Oa; and O y, determine its equation referred to new axes O'X and O'Y 
with new origin at the point where x = 7 and y = —3 (assuming that Oa: is parallel to 
O'X and O y is parallel to O'Y). 

4. Use the parametric equations of the straight line joining the two points (—3,4) and 
(7,-1) in order to find its point of intersection with the straight line whose cartesian 
equation is 

x — y + 4 = 0. 


5.2.6 ANSWERS TO EXERCISES 


1. (a) 

(b) 


2 . 


y = 4a; — 7; 


3 y = x + 17; 


Ay = 3x + 21. 


3. 


y — 2a; — 7. 


4 . 


Y = AX + 33. 


x = — 3 + lOt y — A — 5f, 
giving the point of intersection (at t — |) as (—1,3). 
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UNIT 5.3 - GEOMETRY 3 


STRAIGHT LINE LAWS 
5.3.1 INTRODUCTION 

In practical work, the theory of an experiment may show that two variables, x and y, are 
connected by a straight line equation (or “straight line law” ) of the form 

y = rnx + c. 


In order to estimate the values of m and c, we could use the experimental data to plot a 
graph of y against x and obtain the “best straight line” passing through (or near) the 
plotted points to average out any experimental errors. Points which are obviously out of 
character with the rest are usually ignored. 



It would seem logical, having obtained the best straight line, to measure the gradient, m, 
and the intercept, c, on the y- axis. However, this not always the wisest way of proceeding 
and should be avoided in general. The reasons for this are as follows: 

(i) Economical use of graph paper may make it impossible to read the intercept, since this 
part of the graph may be “off the page” . 

(ii) The use of symbols other than x or y in scientific work may leave doubts as to which 
is the equivalent of the y-axis and which is the equivalent of the x-axis. Consequently, the 
gradient may be incorrectly calculated from the graph. 

The safest way of finding m and c is to take two sets of readings, (xi,yi) and (£2,2/2), from 
the best straight line drawn then solve the simultaneous linear equations 
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yi = mx i + c, 
y 2 = mx 2 + c. 


It is a good idea if the two points chosen are as far apart as possible, since this will reduce 
errors in calculation due to the use of small quantities. 

5.3.2 LAWS REDUCIBLE TO LINEAR FORM 

Other experimental laws which are not linear can sometimes be reduced to linear form by 
using the experimental data to plot variables other than x or y, but related to them. 

EXAMPLES 

1 . y = ax 2 + b. 

Method 

We let X = x 2 , so that y = aX + b and hence we may obtain a straight line by plotting 
y against X. 

2. y = ax 2 + bx. 

Method 

Here, we need to consider the equation in the equivalent form - = ax + b so that, by 
letting Y — giving Y = ax + b, a straight line will be obtained if we plot Y against 
x. 

Note: 

If one of the sets of readings taken in the experiment happens to be 
(x,y) = (0,0), we must ignore it in this example. 

3. xy = ax + b. 

Method 

Two alternatives are available here as follows: 

(a) Letting xy = Y, giving Y = ax + b, we could plot a graph of Y against x. 

(b) Writing the equation as y = a + -, we could let - — X, giving y = a + bX, and plot 
a graph of y against X. 
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4. y = ax b . 

Method 

This kind of law brings in the properties of logarithms since, if we take logarithms of 
both sides (base 10 will do here), we obtain 

logm V = log 10 a + 61 °gio^ 

Letting log 10 y = Y and log 10 x = X, we have 

Y = log 10 a + bX, 

so that a straight line will be obtained by plotting Y against X. 

5. y = ab x . 

Method 

Here again, logarithms may be used to give 

logio V = l°gio a + x l°Sio b 

Letting log 10 y = Y, we have 

Y = log 10 a + x log 10 6, 

which will give a straight line if we plot Y against x. 

6. y = ae bx . 

Method 

In this case, it makes sense to take natural logarithms of both sides to give 

log e y = log e a + bx, 

which may also be written 

In y = In a + bx 

Hence, letting In y = Y, we can obtain a straight line by plotting a graph of Y against 
x. 


Note: 

In all six of the above examples, it is even more important not to try to read off the gradient 
and the intercept from the graph drawn. As before, we should take two sets of readings for 
x (or X) and y (or Y), substitute them in the straight-line form of the equation and solve 
two simultaneous linear equations for the constants required. 
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5.3.3 THE USE OF LOGARITHMIC GRAPH PAPER 


In Examples 4,5 and 6 in the previous section, it can be very tedious looking up on a 
calculator the logarithms of large sets of numbers. We may use, instead, a special kind of 
graph paper on which there is printed a logarithmic scale (see Unit 1.4) along one or both 
of the axis directions. 

0.1 0.2 0.3 0.4 1 2 3 4 10 

Effectively, the logarithmic scale has already looked up the logarithms of the numbers as- 
signed to it provided these numbers are allocated to each “cycle” of the scale in successive 
powers of 10. 

Data which includes numbers spread over several different successive powers of ten will need 
graph paper which has at least that number of cycles in the appropriate axis direction. 

For example, the numbers 0.03, 0.09, 0.17, 0.33, 1.82, 4.65, 12, 16, 20, 50 will need four 
cycles on the logarithmic scale. 

Accepting these restrictions, which make logarithmic graph paper less economical to use 
than ordinary graph paper, all we need to do is to plot the actual values of the variables 
whose logarithms we would otherwise have needed to look up. This will give the straight 
line graph from which we take the usual two sets of readings; these are then substituted into 
the form of the experimental equation which occurs immediately after taking logarithms of 
both sides. 

It will not matter which base of logarithms is being used since logarithms to two differ- 
ent bases are proportional to each other anyway. The logarithmic graph paper does not, 
therefore, specify a base. 

EXAMPLES 

1. y — ax b . 

Method 

(i) Taking logarithms (base 10), log 10 y = log 10 a + 61og 10 a:. 

(ii) Plot a graph of y against x, both on logarithmic scales. 

(iii) Estimate the position of the “best straight line” . 

(iv) Read off from the graph two sets of co-ordinates, (xi,y i) and (^ 2 , 2 / 2 ), as far apart 
as possible. 
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(v) Solve for a and b the simultaneous equations 


logical = l°g 10 a + 61og 10 xi, 

loglO V 2 = logio a + b lo glO ^2 • 


If it is possible to choose readings which are powers of 10, so much the better, but this 
is not essential. 

2. y = ab x . 

Method 

(i) Taking logarithms (base 10), log 10 y = log 10 a + x log 10 b. 

(ii) Plot a graph of y against x with y on a logarithmic scale and x on a linear scale. 

(iii) Estimate the position of the best straight line. 

(iv) Read off from the graph two sets of co-ordinates, (£1,2/1) and (£2,2/2), as f ar apart 
as possible. 

(v) Solve for a and b the simultaneous equations 

logio 2/i = logio a + x i l°gio ^ 
logio V2 = logio a + £2 logio b - 


If it is possible to choose zero for the X\ value, so much the better, but this is not 
essential. 

3. y = ae bx . 

Method 

(i) Taking natural logarithms, In y = In a + bx. 

(ii) Plot a graph of y against x with y on a logarithmic scale and x on a linear scale. 

(iii) Estimate the position of the best straight line. 

(iv) Read off two sets of co-ordinates, (£1,2/1) and (£2,2/2), as far apart as possible. 

(v) Solve for a and b the simultaneous equations 

In 2/1 = hi a + bx 1 , 

In 2/2 = In a + bx 2 . 


If it possible to choose zero for the £1 value, so much the better, but this is not essential. 
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5.3.5 EXERCISES 


In these exercises, use logarithmic graph paper where possible. 

1. The following values of x and y can be represented approximately by the law y = a+bx 2 : 


X 

0 

2 

4 

6 

8 

10 

y 

7.76 

11.8 

24.4 

43.6 

71.2 

107.0 


Use a straight line graph to find approximately the values of a and b. 
2. The following values of x and y are assumed to follow the law y = ab x : 


X 

0.2 

0.4 

0.6 

0.8 

1.4 

1.8 

y 

0.508 

0.645 

0.819 

1.040 

2.130 

3.420 


Use a straight line graph to find approximately the values of a and b. 

3. The following values of x and y are assumed to follow the law y = ae kx : 


X 

0.2 

0.5 

0.7 

1.1 

1.3 

y 

1.223 

1.430 

1.571 

1.921 

2.127 


Use a straight line graph to find approximately the values of a and k. 

4. The table below gives the pressure, P , and the volume, V, of a certain quantity of steam 
at maximum density: 


P 

12.27 

17.62 

24.92 

34.77 

47.87 

65.06 

V 

3,390 

2,406 

1,732 

1,264 

934.6 

699.0 


Assuming that PV n = C , use a straight line graph to find approximately the values of 
n and C. 

5. The coefficient of self induction, L, of a coil, and the number of turns, N, of wire are 
related by the formula L = aN b , where a and b are constants. 

For the following pairs of observed values, use a straight line graph to calculate approx- 
imate values of a and b: 


N 

25 

35 

50 

75 

150 

200 

250 

L 

1.09 

2.21 

5.72 

9.60 

44.3 

76.0 

156.0 


6. Measurements taken, when a certain gas undergoes compression, give the following 
values of pressure, p, and temperature, T : 


P 

10 

15 

20 

25 

35 

50 

T 

270 

289 

303 

315 

333 

353 


Assuming a law of the form T = ap n , use a straight line graph to calculate approximately 
the values of a and n. Hence estimate the value of T when p = 32. 
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5.3.6 ANSWERS TO EXERCISES 

The following answers are approximate; check only that the order of your results are correct. 
Any slight variations in the position of your straight line could affect the result considerably. 

1. a ~ 8.0, b ~ 0.99 

2. a ~ 0.4, b ~ 3.3 

3. a ~ 1.1, k ~ 0.5 

4. n ~ 1.06, C' ~ 65887 

5. a ~ 1.38 x 10~ 3 , b ~ 2.08 

6. a ~ 183.95, n ~ 0.17 
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UNIT 5.4 - GEOMETRY 4 


ELEMENTARY LINEAR PROGRAMMING 
5.4.1 FEASIBLE REGIONS 

(i) The equation, y = mx + c, of a straight line is satisfied only by points which lie on the 
line. But it is useful to investigate the conditions under which a point with co-ordinates 
(x, y ) may lie on one side of the line or the other. 

(ii) For example, the inequality y < mx + c is satisfied by points which lie below the line 
and the inequality y > mx + c is satisfied by points which lie above the line. 

(iii) Linear inequalities of the form Ax + By + C < 0 or Ax + By + C > 0 may be interpretted 
in the same way by converting, if necessary, to one of the forms in (ii). 

(iv) Weak inequalities of the form Ax + By + C < 0 or Ax + By + C > 0 include the points 
which lie on the line itself as well as those lying on one side of it. 

(v) Several simultaneous linear inequalties may be used to determine a region of the xy- 
plane throughout which all of the inequalities are satisfied. The region is called the “feasible 
region” . 

EXAMPLES 

1. Determine the feasible region for the simultaneous inequalities 

x > 0, y > 0, x + y < 20, and 3x + 2y < 48 


Solution 

We require the points of the first quadrant which lie on or below the straight line 
y = 20 — x and on or below the straight line y — — |x + 16. 

The feasible region is shown as the shaded area in the following diagram: 
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2. Determine the feasible region for the following simultaneous inequalities: 

0 < x < 50, 0 < y < 30, x + y < 80, x + y > 60 


Solution 

We require the points which lie on or to the left of the straight line x = 50, on or below 
the straight line y = 30, on or below the straight line y = 80 — x and on or above the 
straight line y = 60 — x. 

The feasible region is shown as the shaded area in the following diagram: 
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5.4.2 OBJECTIVE FUNCTIONS 


An important application of the feasible region discussed in the previous section is that 
of maximising (or minimising) a linear function of the form px + qy subject to a set of 
simultaneous linear inequalities. Such a function is known as an “objective function” 

Essentially, it is required that a straight line with gradient — | is moved across the appropriate 
feasible region until it reaches the highest possible point of that region for a maximum value 
or the lowest possible point for a minimum value. This will imply that the straight line 
px + qy = r is such that r is the optimum value required. 

However, for convenience, it may be shown that the optimum value of the objective function 
always occurs at one of the corners of the feasible region so that we simply evaluate it at 
each corner and choose the maximum (or minimum) value. 

EXAMPLES 

1. A farmer wishes to buy a number of cows and sheep. Cows cost £18 each and sheep 
cost £12 each. 

The farmer has accommodation for not more than 20 animals, and cannot afford to 
pay more than £288. 

If he can reasonably expect to make a profit of £11 per cow and £9 per sheep, how 
many of each should he buy in order to make his total profit as large as possible ? 

Solution 

Suppose he needs to buy x cows and y sheep; then, his profit is the objective function 
P = llx + 9 y. 

Also, 


x > 0, y > 0, x + y < 20, and 18a; + 12 y < 288 or 3x + 2y < 48. 

Thus, we require to maximize P = llx + 9y in the feasible region for the first example 
of the previous section. 

The corners of the region are the points (0,0), (16,0), (0,20) and (8,12), the last of 
these being the point of intersection of the two straight lines x+y = 20 and 3x+2y = 48. 

The maximum value occurs at the point (8, 12) and is equal to 88 + 108 = 196. Hence, 
the farmer should buy 8 cows and 12 sheep. 
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2. A cement manufacturer has two depots, D| and D 2 , which contain current stocks of 80 
tons and 20 tons of cement respectively. 

Two customers Ci and C 2 place orders for 50 and 30 tons respectively. 

The transport cost is £1 per ton, per mile and the distances, in miles, between Di, D 2 , 
Ci and C 2 are given by the following table: 



Ci 

c 2 

Di 

40 

30 

d 2 

10 

20 


From which depots should the orders be dispatched in order to minimise the transport 
costs ? 

Solution 

Suppose that Di distributes x tons to Ci and y tons to C 2 ; then D 2 must distribute 
50 — x tons to Ci and 30 — y tons to C 2 . 

All quantities are positive and the following inequalties must be satisfied: 
x < 50, y < 30, x + y < 80, 80 — {x + y) < 20 or x + y > 60 

The total transport costs, T, are made up of 40a:, 30a/, 10(50 — x) and 20(30 — y). 
That is, 


T = 30a: + 10a/ + 1100, 

and this is the objective function to be minimised. 

From the diagram in the second example of the previous section, we need to evaluate 
the objective function at the points (30,30), (50,30) and (50, 10). 

The minimum occurs, in fact, at the point (30, 30) so that Di should send 30 tons to 
Ci and 30 tons to C 2 while D 2 should send 20 tons to Ci but 0 tons to C 2 . 
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5.4.3 EXERCISES 


1. Sketch, on separate diagrams, the regions of the xy - plane which correspond to the 
following inequalities (assuming that x > 0 and y > 0): 

(a) 


x + y < 6; 


(b) 


x + y > 4; 


3x + y > 6] 


(d) 


x + 3y > 6. 


2. Sketch the feasible region for which all the inequalities in question 1 are satisfied. 

3. Maximise the objective function 5a; + 7 y subject to the simultaneous linear inequalities 


x > 0, y > 0, 3x + 2y > 6 and x + y < 4. 

4. A mine manager has contracts to supply, weekly, 

100 tons of grade 1 coal, 

700 tons of grade 2 coal, 

2000 tons of grade 3 coal, 

4500 tons of grade 4 coal. 

Two seams, A and B, are being worked at a cost of £4000 and £10,000, respectively, 
per shift, and the yield, in tons per shift, from each seam is given by the following 
table: 



Grade 1 

Grade 2 

Grade 3 

Grade 4 

A 

200 

100 

200 

400 

B 

100 

100 

500 

1500 


How many shifts per week should each seam be worked, in order to fulfill the contracts 
most economically ? 
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5. A manufacturer employs 5 skilled and 10 semi-skilled workers to make an article in two 
qualities, standard and deluxe. 

The deluxe model requires 2 hour’s work by skilled workers; the standard model requires 
1 hour’s work by skilled workers and 3 hour’s work by semi-skilled workers. 

No worker works more than 8 hours per day and profit is £10 on the deluxe model and 
£8 on the standard model. 

How many of each type, per day, should be made in order to maximise profits ? 

5.4.4 ANSWERS TO EXERCISES 

1. (a) The region is as follows: 



6 


(b) The region is as follows 


y 


4 


© 


(c) The region is as follows 


(d) The region is as follows: 




3. The feasible region is as follows: 



The maximum value of 5x + 7 y occurs at the point (0, 4) and is equal to 28. 

4. Subject to the simultaneous inequalities 

x > 0, y > 0, 2x + y > 10, x + y > 7, 2x + 5y > 20 and 4x + 5y > 45, 
the function 2x + 5 y has minimum value 20 at any point on the line 2x + 5y = 20. 

5. Subject to the simultaneous inequalities 

x > 0, y > 0, x + 2y < 0 and 3x + 2y < 80, 
the objective function P = 8x + 10 y has maximum value 260 at the point (20, 10). 
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UNIT 5.5 - GEOMETRY 5 


CONIC SECTIONS - THE CIRCLE 

5.5.1 INTRODUCTION 

In this and the following three units, we shall investigate some of the geometry of four 
standard curves likely to be encountered in the scientific applications of Mathematics. They 
are the Circle, the Parabola, the Ellipse and the Hyperbola. 

These curves could be generated, if desired, by considering plane sections through a cone; 
and, because of this, they are often called “conic sections” or even just “conics”. We 
shall not discuss this interpretation further, but rather use a more analytical approach. 

The properties of the four standard conics to be included here will be restricted to those 
required for simple applications work and, therefore, these notes will not provide an extensive 
course on elementary co-ordinate geometry. 

Useful results from previous work which will be used in these units include the Change of 
Origin technique (Unit 5.2) and the method of Completing the Square (Unit 1.5). These 
results should be reviewed, if necessary, by the student. 

DEFINITION 

A circle is the path traced out by (or “locus” of) a point which moves at a fixed distance, 
called the “radius”, from a fixed point, called the “centre”. 

5.5.2 STANDARD EQUATIONS FOR A CIRCLE 

(a) Circle with centre at the origin and having radius a. 



Using Pythagoras’s Theorem in the diagram, the equation which is satisfied by every point 
(x, y ) on the circle, but no other points in the plane of the axes, is 


2,2 2 
x + y = a . 


This is therefore the cartesian equation of a circle with centre (0, 0) and radius a. 
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Note: 

The angle 9 in the diagram could be used as a parameter for the point (x, y ) to give the 
parametric equations 


x = a cos 6 , y = a sin 6. 

Each point on the curve has infinitely many possible parameter values, all differing by a 
multiple of 27 t; but it is usually most convenient to choose the value which lies in the interval 
—7 T < 9 < 7T. 

(b) Circle with centre ( h , k) having radius a. 



If we were to consider a temporary change of origin to the point ( h , k) with A"-axis and 
Y -axis, the circle would have equation 

X 2 + Y 2 = a 2 , 

with reference to the new axes. But, from previous work, 

X = x — h and Y = y — k. 


Hence, with reference to the original axes, the circle has equation 

(x — h ) 2 + {y — k) 2 = a 2 ; 


or, in its expanded form, 
where 


x 2 + y 2 — 2 lix — 2 ky + c = 0, 
c = h 2 + k 2 — a 2 . 
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Notes: 

(i) The parametric equations of this circle with reference to the temporary new axes through 
the point (h, k ) would be 

X = a cos 9, Y = a sin 6. 

Hence, the parametric equations of the circle with reference to the original axes are 

x = h + a cos 6 , y — k + a sin 6. 


(ii) If the equation of a circle is encountered in the form 

(x — h ) 2 + (y — k) 2 = a 2 , 

it is very easy to identify the centre, (h, k ) and the radius, a. If the equation is encountered 
in its expanded form, the best way to identify the centre and radius is to complete the 
square in the x and y terms in order to return to the first form. 

EXAMPLES 


1 . 


Determine the co-ordinates of the centre and the value of the radius of the circle whose 
equation is 

x 2 + y 2 + 4x + 6y + 4 = 0. 


Solution 

Completing the square in the x terms, 

x 2 + 4x = (x + 2) 2 — 4. 
Completing the square in the y terms, 

y 2 + Gy = (y + 3) 2 - 9. 


The equation of the circle therefore becomes 

{x + 2) 2 + (y + 3) 2 = 9. 

Hence the centre is the point (—2, —3) and the radius is 3. 

2. Determine the co-ordinates of the centre and the value of the radius of the circle whose 
equation is 

5a; 2 + 5 y 2 — 10a; + 15 y + 1 = 0. 


Solution 

Here it is best to divide throughout by the coefficient of the x 2 and y 2 terms, even if 
some of the new coefficients become fractions. We obtain 

X 

x 2 + y 2 — 2a; + 3y + - =0. 

5 

Completing the square in the x terms, 

x 2 — 2a; = (x — l) 2 — 1. 
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Completing the square in the y terms, 

y 2 + 3y = (y + 0 - 

The equation of the circle therefore becomes 

(®- !) 2 + (y + ^) = 

Hence the centre is the point (l, — and the radius 

Note: 

Not every equation of the form 

x 2 + y 2 — 2 hx — 2 ky + c = 0 

represents a circle because, for some combinations of h, k and c, the radius would not be a 
real number. In fact, 

a = Vh 2 + k 2 — c, 

which could easily turn out to be unreal. 

5.5.3 EXERCISES 


9 

4' 


61 

20 ' 


is 


% = 1.75 


1 . 

2 . 


Write down the equation of the circle with centre (4, —3) and radius 2. 

Determine the co-ordinates of the centre and the value of the radius of the circle whose 
equation is 

x 2 + y 2 — 2x + Ay — 11 = 0. 


3. Determine the co-ordinates of the centre and the value of the radius of the circle whose 
equation is 

36a; 2 + 36 y 2 - 36a; - 24 y - 131 = 0. 

4. Determine the equation of the circle passing through the point (4, —3) and having centre 
(2, 1). What is the radius of the circle and what are its parametric equations ? 

5. Use the parametric equations of the straight line joining the two points (2, 4) and —4, 2) 
to find its points of intersection with the circle whose equation is 

x 2 + y 2 + 4a; — 2y = 0. 


Hint: 

Substitute the parametric equations of the straight line into the equation of the circle 
and find two solutions for the parameter. 
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5.5.4 ANSWERS TO EXERCISES 


1. The equation of the circle is either 

(x — 4) 2 + (y + 3) 2 = 4, 
or 

x 2 + y 2 — 8x + 6y + 21 = 0. 

2. The centre is (1, —2) and the radius is 4. 

3. The centre is and radius is 2. 

4. The equation is 

(x - 2) 2 + (y - l) 2 = 20 

and the radius is \/20. 

The parametric equations are 

x = 2 + V20 cos 6, y — 1 + V20 sin 9. 


5. From 


x = 2 — 6t, and y = 4 — 2t, 


we obtain 

40t 2 - 60t + 20 = 0, 

giving t — \ and t — 1. 

The points of intersection are (—1,3) and (—4,2). 
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UNIT 5.6 - GEOMETRY 6 


CONIC SECTIONS - THE PARABOLA 
5.5.1 INTRODUCTION 

The Standard Form for the equation of a Parabola 
DEFINITION 

A parabola is the path traced out by (or “locus” of) a point, P, whose distance, SP, from a 
fixed point, S, called the “focus”, is equal to its perpendicular distance, PM, from a fixed 
line, /, called the “directrix”. 



For convenience, we may take the directrix, /, to be a vertical line - with the perpendicular 
onto it from the focus, S, being the x-axis. We could take the y-axis to be the directrix itself, 
but the equation of the parabola turns out to be simpler if we use a different line; namely 
the line parallel to the directrix passing through the mid-point of the perpendicular from the 
focus onto the directrix. This point is one of the points on the parabola so that we make 
the curve pass through the origin. 



Letting the focus be the point (a, 0) (since it lies on the x-axis) the definition of the parabola 
implies that SP = PM. That is, 



x + a. 
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Squaring both sides gives 


(x — a) 2 + y 2 = x 2 + 2 ax + a 2 , 
or 

x 2 — 2 ax + a 2 + y 2 = x 2 + 2 ax + a 2 . 


This reduces to 


y 2 = 4 ax 


and is the standard equation of a parabola with “vertex” at the origin, focus at (a, 0) and 
axis of symmetry along the x-axis. All other versions of the equation of a parabola which 
we shall consider will be based on this version. 


Notes: 

(i) If a is negative, the bowl of the parabola faces in the opposite direction towards negative 
x values. 

(ii) Any equation of the form y 2 = kx, where k is a constant, represents a parabola with 
vertex at the origin and axis of symmetry along the x-axis. Its focus will lie at the point 

it is worth noting this observation for future reference. 

(iii) The parabola y 2 = 4ax may be represented parametrically by the pair of equations 

x = at 2 , y = 2 at] 

but the parameter, t, has no significance in the diagram such as was the case for the circle. 

5.6.2 OTHER FORMS OF THE EQUATION OF A PARABOLA 
(a) Vertex at (0, 0) with focus at (0, a) 



This parabola is effectively the same as the standard one except that the roles of x and y 
have been reversed. We may assume, therefore that the curve has equation 

x 2 = Aay 

with associated parametric equations 

x = 2 at, y = at 2 
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(b) Vertex at (h, k ) with focus at (h + a, k ) 



If we were to consider a temporary change of origin to the point ( h,k ), with X-axis and 
X-axis, the parabola would have equation 

Y 2 = 4 aX. 


With reference to the original axes, therefore, the parabola has equation 

(y — k ) 2 = 4a(a; — h). 


Notes: 

(i) Such a parabola will often be encountered in the expanded form of this equation, contain- 
ing quadratic terms in y and linear terms in x. Conversion to the stated form by completing 
the square in the y terms will make it possible to identify the vertex and focus. 

(ii) With reference to the new axes with origin at the point (h, k ), the parametric equations 
of the parabola would be 

X = at 2 , Y = 2 at. 

Hence, with reference to the original axes, the parametric equations are 

x = h + at 2 , y = k + 2 at. 
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(c) Vertex at (h, k ) with focus at (h, k + a) 



If we were to consider a temporary change of origin to the point (/i, k) with X-axis and 
X-axis, the parabola would have equation 

X 2 = 4 aY. 


With reference to the original axes, therefore, the parabola has equation 

(x — h ) 2 = 4 a(y — k). 


Notes: 

(i) Such a parabola will often be encountered in the expanded form of this equation, contain- 
ing quadratic terms in x and linear terms in y. Conversion to the stated form by completing 
the square in the x terms will make it possible to identify the vertex and focus. 

(ii) With reference to the new axes with origin at the point (h, k ), the parametric equations 
of the parabola would be 

X = 2 at, Y = at 2 . 

Hence, with reference to the original axes, the parametric equations are 

x = h + 2 at, y = k + at 2 . 


EXAMPLES 

1. Give a sketch of the parabola whose cartesian equation is 

y 2 — 6y + 3x = 10, 

showing the co-ordinates of the vertex, focus and intesections with the x-axis and y-axis. 

Solution 

First, we must complete the square in the y terms obtaining 

y 2 - 6y = (y - 3) 2 - 9. 
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Hence, the equation becomes 


That is, 


or 


(y - 3) 2 - 9 + 3a; = 10. 
(y - 3) 2 = 19 - 3x, 

-*(-!)(■ 


19 \ 

t ! la:- yj • 


Thus, the vertex lies at the point ( ^, 3 ) and the focus lies at the point ( 
is, (g,3). 

The parabola intersects the x-axis where y — 0, i.e. 

3x = 10, 

giving x — y. 

The parabola intersects the y-axis where x = 0; that is, 

y 2 - 6y - 10 = 0, 

which is a quadratic equation with solutions 

6 ± V36 + 40 


V = 


= 7.36 or - 1.36 



2. Use the parametric equations of the parabola 

x 2 = 8 y 

to determine its points of intesection with the straight line 

y = x + 6. 

Solution 

The parametric equations are x = 4 1, y = 2 1 2 . 

Substituting these into the equation of the straight line, we have 

2 1 2 = 4t + 6. 


|, 3) ; that 


5 



That is, 


t 2 - 2t - 3 = 0, 
or 

(t — 3)(t + 1) = 0, 

which is a quadratic equation in t having solutions t — 3 and t — — 1. 
The points of intersection are therefore (12, 18) and (—4,2). 

5.6.3 EXERCISES 


1. For the following parabolae, determine the co-ordinates of the vertex, the focus and the 
points of intersection with the x- axis and y - axis where appropriate: 

(a) 

(y — l) 2 — 4(x — 2); 


(b) 


(x + 1) 2 = 8(j/~3); 
2x = y 2 + Ay + 6; 


(d) 


2 . 


x 2 + Ax — Ay + 6 = 0. 

Use the parametric equations of the parabola 

y 2 = 12x 

to determine its points of intersection with the straight line 

6x + 5y — 12 = 0. 


5.6.4 ANSWERS TO EXERCISES 


1. (a) Vertex (2, 1), Focus (3, 1), Intersection (f,0) with the a;-axis; 

(b) Vertex (—1,3), Focus (—1,5), Intersection (0, y) with the y- axis; 

(c) Vertex (1, —2), Focus (|, —2), Intersection (3,0) with the rr-axis; 

(d) Vertex (—2, |), Focus (—2, |), Intersection (0, |) with the y- axis. 

2. x = 3 1 2 and y = 6f give 18t 2 + 30 1 — 12 = 0 with solutions t = \ and t = —2. Hence 
the points of intersection are (|,2) and (12, —12). 
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UNIT 5.7 - GEOMETRY 7 


CONIC SECTIONS - THE ELLIPSE 
5.7.1 INTRODUCTION 

The Standard Form for the equation of an Ellipse 
DEFINITION 

The Ellipse is the path traced out by (or “locus of) a point, P, for which the distance, SP, 
from a fixed point, S, and the perpendicular distance, PM, from a fixed line, /, satisfy a 
relationship of the form 

SP = e.PM, 

where e < 1 is a constant called the “eccentricity” of the ellipse. The fixed line, /, is called 
a “directrix” of the ellipse and the fixed point, S, is called a “focus” of the ellipse. 



In fact, there are two foci and two directrices because the curve turns out to be symmetrical 
about a line parallel to l and the perpendicular line from S onto l. The diagram below 
illustrates two foci, S and S', together with two directrices, l and l'. The axes of symmetry 
are taken as the co-ordinate axes. 



It can be shown that, with this system of reference, the ellipse has equation 


x 

a 


2 

2 


+ 


b 2 


1 , 


with associated parametric equations 


x = a cos 9, y = b sin 9. 
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The curve clearly intersects the x-axis at (±a, 0) and the y - axis at (0, ±6). Whichever is the 
larger of a and b defines the length of the “semi-major axis” and whichever is the smaller 
defines the length of the “semi-minor axis” . 

For the sake of completeness, it may further be shown that the eccentricity, e, is obtainable 
from the formula 

b 2 = a 2 (l - e 2 ) 

and, having done so, the foci lie at (±ae, 0) with directrices at x — ±-. However, in these 
units, students will not normally be expected to determine the eccentricity, foci or directrices 
of an ellipse. 

5.7.2 A MORE GENERAL FORM FOR THE EQUATION OF AN ELLIPSE 

The equation of an ellipse, with centre (h, k) and axes of symmetry parallel to Ox and O y 
respectively, is easily obtainable from the standard form of equation by a temporary change 
of origin to the point (h, k ). We obtain 

(x-h) 2 (y ~k) 2 _ 
a 2 6 2 

with associated parametric equations 

x = h + a cos 6, y = k + b sin 9. 

Ellipses will usually be encountered in the expanded form of the above cartesian equation 
and it will be necessary to complete the square in both the x terms and the y terms in order 
to locate the centre of the ellipse. The expanded form will be similar in appearance to that 
of a circle but the coefficients of x 2 and y 2 , though both of the same sign, will not be equal 
to each other. 

EXAMPLE 

Determine the co-ordinates of the centre and the lengths of the semi-axes of the ellipse whose 
equation is 

3x 2 + y 2 + 12x — 2y + 1 = 0. 

Solution 

Completing the square in the x terms gives 

3x 2 + 12x = 3 [x 2 + Ax] =3 [(x + 2) 2 - 4] = 3(x + 2) 2 - 12. 

Completing the square in the y terms gives 

y 2 - 2y = (y - l) 2 - 1. 

Hence, the equation of the ellipse becomes 

3(x + 2) 2 + (y - l) 2 = 12. 


2 



That is, 


(x + 2) 2 (y-1) 2 

4 12 


The centre is thus located at the point (—2, 1) and the semi-axes have lengths a 
b = a/12- 



2 and 


5.7.3 EXERCISES 


1. For each of the following ellipses, determine the co-ordinates of the centre and give a 
sketch of the curve: 


x 2 + 4 y 2 — 4x — 8y + 4 = 0; 


(b) 


(c) 


x 2 + 4y 2 + 16y + 12 = 0; 


x 2 + Ay 2 + 6x — 8y + 9 = 0. 

2. Determine the lengths of the semi-axes of the ellipse whose equation is 

9a; 2 + 25 y 2 = 225 

and write down also a pair of parametric equations for this ellipse. 
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5.7.4 ANSWERS TO EXERCISES 


1. (a) Centre (2, 1) with a — 2 and 6 = 1. 



(b) Centre (0, —2) with a = 2 and 6 = 1. 

y 

0 - 



(c) Centre (—3, 1) with a — 2 and 6 = 1. 



2. a = 5 and 6 = 3, giving the parametric equations x = 5 cos#, y 
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UNIT 5.8 - GEOMETRY 8 


CONIC SECTIONS - THE HYPERBOLA 

5.8.1 INTRODUCTION 

DEFINITION 

The hyperbola is the path traced out by (or “locus” of) a point, P, for which the distance, 
SP, from a fixed point, S, and the perpendicular distance, PM, from a fixed line, /, satisfy a 
relationship of the form 

SP = e.PM, 

where e > 1 is a constant called the “eccentricity” of the hyperbola. The fixed line, /, 
is called a “directrix” of the hyperbola and the fixed point, S, is called a “focus” of the 
hyperbola. 



M 
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In fact, there are two foci and two directrices because the curve turns out to be symmetrical 
about a line parallel to / and the perpendicular line from S onto l. The diagram below 
illustrates two foci S and S' together with two directrices / and l'. The axes of symmetry are 
taken as the co-ordinate axes. 



It can be shown that, with this system of reference, the hyperbola has equation 



a 2 b 2 


1 , 


1 



with associated parametric equations 


x = asecO, y = b tan 6 

although, for students who meet “hyperbolic functions” , a better set of parametric equations 
would be 

x = acosht, y = bsinht. 

The curve clearly intersects the x-axis at (±o, 0) but does not intersect the y-axis at all. 

For the sake of completeness, it may further be shown that the eccentricity, e, is obtainable 
from the formula 

b 2 = a 2 ( e 2 - l) 

and, having done so, the foci lie at (±ae, 0) with directrices at x = ±“. However, in these 
units, students will not normally be expected to determine the eccentricity, foci or directrices 
of a hyperbola 

Note: 

A similar hyperbola to the one above, but intersecting the y-axis rather than the x-axis, has 
equation 


The roles of x and y are simply reversed. 



5.8.2 ASYMPTOTES 


A special property of the hyperbola is that, at infinity, it approaches two straight lines 
through the centre of the hyperbola called “asymptotes” . 


It can be shown that both of the hyperbolae 



a 2 b 2 


v 2 x 2 

1 and — 

b 2 a z 


1 


have asymptotes whose equations are: 
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0 . 


x _y 

a b 


0 and X - + V - = 
a b 


These are easily obtained by factorising the left hand side of the equation of the hyperbola, 
then equating each factor to zero. 

5.8.3 MORE GENERAL FORMS FOR THE EQUATION OF A HYPERBOLA 

The equation of a hyperbola, with centre (h, k) and axes of symmetry parallel to Oa; and O y 
respectively, is easily obtainable from one of the standard forms of equation by a temporary 
change of origin to the point ( h,k ). We obtain either 


(x — h ) 2 (y — k ) 2 

~0?‘ b 2 


with associated parametric equations 

x = h + asecO, y = k + b tan 6 


or 


(y — k) 2 (. x — h ) 2 

b 2 a 2 


with associated parametric equations 

x = h + a tan 9, y = k + bsecO. 


Hyperbolae will usually be encountered in the expanded form of the above cartesian equations 
and it will be necessary to complete the square in both the x terms and the y terms in order 
to locate the centre of the hyperbola. The expanded form will be similar in appearance to 
that of a circle but the coefficients of x 2 and y 2 will be different numerically and opposite in 
sign. 

EXAMPLE 

Determine the co-ordinates of the centre and the equations of the asymptotes of the hyperbola 
whose equation is 

Ax 2 — y 2 + 16a; + 6y + 6 = 0. 


Solution 


Completing the square in the x terms gives 


4ar + 16a; = 4 ar + 4a; 


= 4 


(x + 2) s 


4(x + 2) 2 - 16. 
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Completing the square in the y terms gives 


-y 2 + 6 y = - y - 6y 


(y — 3) 2 — 9 


Hence the equation of the hyperbola becomes 


4(* + 2) 2 - (y - 3) 2 = 1 


~(y — s) 2 + 9. 


or 


(x + 2) 2 (y - 3) 2 

/ i \ 2 12 


1 . 


The centre is thus located at the point (—2,3). 

The asymptotes are best obtained by factorising the left hand side of the penultimate version 
of the equation of the hyperbola, then equating each factor to zero. We obtain 

2(x + 2) - (y-3) = 0 and 2(x + 2) + (y - 3) = 0. 

In other words, 


2x — y + 7 = 0 and 2x + y + 1 = 0. 


To sketch the graph of a hyperbola, it is not always enough to have the position of the centre 
and the equations of the asymptotes. It may also be necessary to investigate some of the 
intersections of the curve with the co-ordinate axes. In the current example, by substituting 
first y — 0 and then x = 0 into the equation of the hyperbola, it is possible to determine 
intersections at (—0.84,0), (—7.16,0), (0, — 0.87) and (0,6.87). 

V \ ■' 

V -7 


-W^G- 


5.8.4 THE RECTANGULAR HYPERBOLA 

For some hyperbolae, it will turn out that the asymptotes are at right-angles to each other; 
in which case, the asymptotes themselves could be used as the x-axis and y-axis. When 
this choice of reference system is made for a hyperbola with centre at the origin, it can be 
shown that hyperbola has the simpler equation xy = C, where C is a constant. 
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Similarly, for a rectangular hyperbola with centre at the point (h, k) and asymptotes used 
as the axes of reference, the equation will be ( x — h)(y — k) = C. 



Note: 

A suitable pair of parametric equations for the rectangular hyperbola ( x — h ) (y — k) = C are 


x = t + h, 


, C 

y = k + T . 


EXAMPLES 

1. Determine the centre of the rectangular hyperbola whose equation is 

7x — 3y + xy — 31 = 0. 
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Solution 

The equation factorises into the form 

(x-3)(j/ + 7) = 10. 

Hence, the centre is located at the point (3, —7). 

2. A certain rectangular hyperbola has parametric equations 

1 

x — 1 + 1, y = 3 . 

’ y t 

Determine its points of intersection with the straight line x + y = 4. 

Solution 

Substituting for x and y into the equation of the straight line, we obtain 

1 9 

l+f+3— -=4 or A — 1 = 0. 

Hence, t — ±1 giving points of intersection at (2,2) and (0,4). 

5.8.5 EXERCISES 


1. For each of the following hyperbolae, determine the co-ordinates of the centre and the 
equations of the asympotes. Give a sketch of the curve, indicating where appropriate, 
the co-ordinates of its points of intersection with the a;- axis and y-axis: 


(a) 

(b) 


(d) 


x 2 - y 2 - 2y = 0; 
y 2 — x 2 — 6x = 10; 
x 2 — y 2 — 2x — 2y = 4; 
y 2 — x 2 — 6x + 4y = 14; 


9a: 2 — 4y 2 + 18a; — 16y = 43. 


2. Determine a pair of parametric equations for the rectangular hyperbola whose equation 
is xy — x + 2y — 6 = 0 and hence obtain its points of intersection with the straight line 
y = x + 3. Sketch the hyperbola and the straight line on the same diagram. 
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UNIT 5.9 - GEOMETRY 9 


CURVE SKETCHING IN GENERAL 
Introduction 

The content of the present section is concerned with those situations where it is desirable to 
find out the approximate shape of a curve whose equation is known, but not necessarily to 
determine an accurate “plot” of the curve. 

In becoming accustomed to the points discussed below, the student should not feel that 
every one has to be used for a particular curve; merely enough of them to give a satisfactory 
impression of what the curve looks like. 

5.9.1 SYMMETRY 

A curve is symmetrical about the x-axis if its equation contains only even powers of y. It is 
symmetrical about the y-axis if its equation contains only even powers of x. 

We say also that a curve is symmetrical with respect to the origin if its equation is unaltered 
when both x and y are changed in sign. In other words, if a point (x, y ) lies on the curve, 
so does the point (— x, —y). 

ILLUSTRATIONS 


1. The curve whose equation is 

x 2 [y 2 — 2) = x 4 + 4 

is symmetrical about to both the x-axis and the y axis. This means that, once the 
shape of the curve is known in the first quadrant, the rest of the curve is obtained from 
this part by reflecting it in both axes. 

The curve is also symmetrical with respect to the origin. 

2. The curve whose equation is 

xy = 5 

is symmetrical with respect to the origin but not about either of the co-ordinate axes. 


5.9.2 INTERSECTIONS WITH THE CO-ORDINATE AXES 

Any curve intersects the x-axis where y = 0 and the y-axis where x = 0; but sometimes the 
curve has no intersection with one or more of the co-ordinate axes. This will be borne out 
by an inability to solve for x when y = 0 or for y when x = 0 (or both). 
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ILLUSTRATION 


The circle 

meets the x-axis where 


x 2 + y 2 — Ax — 2y + 4 = 0 
x 2 — Ax + 4 = 0. 


That is, 


(* - 2) 2 = 

giving a double intersection at the point (2,0). 
x-axis at (2,0). 


0, 

This means that the circle touches the 


The circle meets the y-axis where 


V 2 ~ 2y + 4 = 0. 


That is, 


(y - 1) 2 = -3, 

which is impossible, since the left hand side is bound to be positive when y is a real number. 
Thus, there are no intersections with the y-axis. 


5.9.3 RESTRICTIONS ON THE RANGE OF EITHER VARIABLE 


It is sometimes possible to detect a range of x values or a range of y values outside of which 
the equation of a curve would be meaningless in terms of real geometrical points of the 
cartesian diagram. Usually, this involves ensuring that neither x nor y would have to assume 
complex number values; but other kinds of restriction can also occur. 

ILLUSTRATIONS 


1 . 


2. 


The curve whose equation is 

y 2 = 4x 

requires that x shall not be negative; that is, x > 0. 

The curve whose equation is 

y 2 = x (x 2 — l) 

requires that the right hand side shall not be negative; and from the methods of Unit 
1.10, this will be so when either x > 1 or —1 < x < 0. 
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5.9.4 THE FORM OF THE CURVE NEAR THE ORIGIN 


For small values of x (or y), the higher powers of the variable can often be usefully neglected 
to give a rough idea of the shape of the curve near to the origin. 

This method is normally applied to curves which pass through the origin, although the 
behaviour near to other points can be considered by using a temporary change of origin. 

ILLUSTRATION 


The curve whose equation is 
approximates to the straight line 

for very small values of x. 

5.9.5 ASYMPTOTES 


y = 3a; 3 — 2x 
y = — 2x 


DEFINITION 

An “asymptote” is a straight line which is approached by a curve at a very great distance 
from the origin. 

(i) Asymptotes Parallel to the Co-ordinate Axes 

Consider, by way of illustration, the curve whose equation is 

2 _ a; 3 (3 - 2 y) 


(a) By inspection, we see that the straight line x — 1 “meets” this curve at an infinite value 
of y, making it an asymptote parallel to the y- axis. 

(b) Now suppose we re-write the equation as 

3 _ y 2 ( x - x ) 

3 — 2a/ 

Inspection, this time, suggests that the straight line y = 3 “meets” the curve at an infinite 
value of x, making it an asymptote parallel to the x axis. 

(c) Another way of arriving at the conclusions in (a) and (b) is to write the equation of the 
curve in a form without fractions, namely 

y 2 (x-l)-x 3 (3-2y) = 0, 
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then equate to zero the coefficients of the highest powers of x and y. That is, 

The coefficient of y 2 gives x — 1 = 0. 

The coefficient of a: 3 gives 3 — 2y = 0. 

It can be shown that this method may be used with any curve to ford asymptotes parallel 
to the co-ordinate axes. Of course, there may not be any, in which case the method will not 
work. 

(ii) Asymptotes in General for a Polynomial Curve 

This paragraph requires a fairly advanced piece of algebraical argument, but an outline proof 
will be included, for the sake of completeness. 

Suppose a given curve has an equation of the form 

P(x,y) = 0 

where P(x,y) is a polynomial in x and y. 

Then, to find the intersections with this curve of a straight line 

y = mx + c, 

we substitute mx + c in place of y into the equation of the curve, obtaining a polynomial 
equation in x, say 

Of) T CliX T Ci2X~ T T <x n x — 0. 

In order for the line y = mx + c to be an asymptote, this polynomial equation must have 

coincident solutions at infinity. 

But now let us replace x by ^ giving, after multiplying throughout by u n , the new polynomial 
equation 

QqU T CL\U T Oj2 W T T Ci n —\U T CL n — 0 

This equation must have coincident solutions at u = 0 which will be the case provided 

a n = 0 and a n _ i = 0. 


Conclusion 

To find the asymptotes (if any) to a polynomial curve, we first substitute y = mx + c into 
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the equation of the curve. Then, in the polynomial equation obtained, we equate to zero 
the two leading coefRcents; (that is, the coefficients of the highest two powers of x ) and 
solve for m and c. 

EXAMPLE 

Determine the equations of the asymptotes to the hyperbola 

_ r; ! 

a 2 b 2 


Solution 


Substituting y = mx + c gives 


x 2 (mx + c) 2 
a 2 


1 . 


That is, 



2 mcx c 2 
~¥ ¥ 


Equating to zero the two leading coefficients; that is, the coefficients of x 2 and x, we obtain 


1 m 2 

a 2 b 2 


0 and 


2 me 

IF 


0 . 


No solution is obtainable if we let rn — 0 in the second of these statements since it would 
imply \ — 0 in the first statement, which is impossible. Therefore we must let c = 0 in the 
second statement, and m — ±- in the first statement. 

’ a 


The asymptotes are therefore 

in other words, 
as used earlier in the section on the hyperbola 


y = ±-x. 
a 


x y x y 

-~j = 0 or - + f = 0, 
a b a b 
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5.9.6 EXERCISES 


1. Sketch the graphs of the following equations: 

(a) 

1 

y = x+ 

x 


(b) 


y = 


l 

X 2 + 1 ’ 


(c) 


y 


2 


X 

x-2' 


(d) 


(x — l)(x + 4) 
(x — 2) (a; — 3) ’ 


(f) 


(g) 


y(x + 2) = (x + 3)(x ~ 4); 


x 2 ( y 2 - 25) = y: 


y = 6 — e 2x . 


2. For each of the following curves, determine the equations of the asymptotes which are 
parallel to either the x-axis or the y- axis: 


(a) 

(b) 


xy 2 + x 2 — 1 = 0; 


x 2 y 2 = 4 (x 2 + y 2 ) ; 
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y = 


x 2 — 3x + 5 
x — 3 


3. Determine all the asymptotes of the following curves: 


(a) 

(b) 


x 3 — xy 2 + 4x — 16 = 0; 


if' + 2 y 2 — x 2 y + y — x + 4 = 0. 


5.9.7 ANSWERS TO EXERCISES 

L(a) 
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UNIT 5.10 - GEOMETRY 10 


GRAPHICAL SOLUTIONS 

5.10.1 INTRODUCTION 

An algebraic equation in a variable quantity, re, may be written in the general form 

f(x) = 0 , 

where /(re) is an algebraic expression involving x ; we call it a “function of re” (see Unit 

10 . 1 ). 

In the work which follows, /(re) will usually be either a linear function of the form arc + b, 
where a and b are constants, or a quadratic function of the form arc 2 + bx + c where a, b 
and c are constants. 

The solutions of the equation /(re) = 0 consist of those values of re which, when substituted 
into the function /(re), cause it to take the value zero. 

The solutions may also be interpreted as the values of re for which the graph of the equation 

V = f(x) 

meets the re-axis since, at any point of this axis, y is equal to zero. 

5.10.2 THE GRAPHICAL SOLUTION OF LINEAR EQUATIONS 

To solve the equation 

arc + b = 0, 

we may plot the graph of the equation y = ax + b to find the point at which it meets the 
re- axis. 

EXAMPLES 

1. By plotting the graph of y = 3re — 5 from re = 0 to re = 3, solve the linear equation 

3re - 5 = 0. 


Solution 


re 

0 

1 

2 

3 

y 

-5 

-2 

1 

4 


1 






5.10.3 THE GRAPHICAL SOLUTION OF QUADRATIC EQUATIONS 

To solve the quadratic equation 

ax 2 + bx + c = 0 

by means of a graph, we may plot the graph of the equation y = ax 2 + bx + c and determine 
the points at which it crosses the x-axis. 

An alternative method is to plot the graphs of the two equations y = ax 2 + bx and y = —c 
in order to determine their points of intersection. This method is convenient since the first 
graph has the advantage of passing through the origin. 

EXAMPLE 

By plotting the graph of y = x 2 — Ax from x = — 2 to x = 6, solve the quadratic equations 
(a) 

x 2 — 4x = 0; 


(b) 


x 2 — Ax + 2 = 0; 


(c) 


x 2 — 4x — 1 = 0. 


Solution 

A table of values for the graph of y = x 2 — 4x is 


X 

-2 

-1 

0 

1 

2 

3 

4 

5 

6 

y 

12 

5 

0 

-3 

-2 

-3 

0 

5 

12 


For parts (b) and (c), we shall also need the graphs of y = — 2 and y — 1. 
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Hence, the three sets of solutions are: 

(a) 

x = 0 and x = 4; 


(b) 


x — 3.4 and x ~ 0.6; 


x — 4.2 and x — —0.2 


5.10.4 THE GRAPHICAL SOLUTION OF SIMULTANEOUS EQUATIONS 

A simple extension of the ideas covered in the previous paragraphs is to solve either a pair 
of simultaneous linear equations or a pair of simultaneous equations consisting of one linear 
and one quadratic equation. More complicated cases can also be dealt with by a graphical 
method but we shall limit the discussion to the simpler ones. 

EXAMPLES 


1. By plotting the graphs of 5x + y = 2 and —3 x + y = 6 from x = —2 to x — 2, determine 
the common solution of the two equations. 

Solution 


X 

-2 

-1 

0 

1 

2 

y i = 2 — 5a; 

12 

7 

2 

-3 

-8 

y 2 = 6 + 3a: 

0 

3 

6 

9 

12 


4 




Hence, x = —0.5 and y = 4.5. 

2. By plotting the graphs of the equations y = x 2 and y — 2 — 3x from x = 
determine their common solutions and hence solve the quadratic equation 

x 2 + 3a; — 2 = 0. 


Solution 


X 

-4 

-3 

-2 

-1 

0 

1 

2 

]-J\ = X 2 

16 

9 

4 

1 

0 

1 

4 

y 2 = 2 - 3x 

14 

11 

8 

5 

2 

-1 

-4 



Hence x ~ 0.6 and x — —3.6. 




5.10.5 EXERCISES 


In these exercises, state your answers correct to one place of decimals. 

1. Use a graphical method to solve the following linear equations: 
fa) 


(b) 


8a; — 3 = 0; 


8x = 7. 


2. Use a graphical method to solve the following quadratic equations: 

(a) 

2x 2 — x = 0; 

(b) 


2ar — x + 3 = 10; 


(c) 


2ar — x — 11. 

3. Use a graphical method to solve the foilwing pairs of simultaneous equations: 

(a) 

3a; — y = 6 and x + y = 0; 

(b) 

x + 2y = 13 and 2a; — 3 y — 14; 

(c) 

y = 3x 2 and y = —5a; + 1. 
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5.10.6 ANSWERS TO EXERCISES 


1. (a) 

(b) 

2. (a) 

(b) 

(c) 

3. (a) 

(b) 

(c) 


x ~ 0.4; 


x ~ 0.9 


x = 0 and x = 2; 


x ~ 2.1 and x ~ —1.6; 


x ~ 2.6 and x ~ —2.1 


x = 1.2 and y = —1.2; 


x ~ 9.6 and y ~ 1.7; 


x ~ 0.18 and 2 / — 0.1 or x ~ —1.8 and y ~ 10.2 
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UNIT 5.11 - GEOMETRY 11 - POLAR CURVES 


5.11.1 INTRODUCTION 

The concept of polar co-ordinates was introduced in Unit 5.1 as an alternative method, to 
cartesian co-ordinates, of specifying the position of a point in a plane. It was also seen that 
a relationship between cartesian co-ordinates, x and y , may be converted into an equivalent 
relationship between polar co-ordinates, r and 9 by means of the formulae, 

x — r cos 9, and y — r sin 9, 

while the reverse process may be carried out using the formulae 

r 2 = x 2 + y 2 and 6 = tan _1 (y/x). 

Sometimes the reverse process may be simplified by using a mixture of both sets of formulae. 

In this Unit, we shall consider the graphs of certain relationships between r and 9 without 
necessarily refering to the equivalent of those relationships in cartesian co-ordinates. The 
graphs obtained will be called “polar curves”. 

Note: 

In Unit 5.1, no consideration was given to the possibility of negative values of r; in fact, 
when polar co-ordinates are used in the subject of complex numbers (see Units 6.1 - 6.6) 
r is not allowed to take negative values. 

However, for the present context it will be necessary to assign a meaning to a point ( r,9 ), 
in polar co-ordinates, when r is negative. 

We simply plot the point at a distance of |r| along the 9 — 180° line; and, of course, this 
implies that, when r is negative, the point (r, 9) is the same as the point (|r|, 9 — 180°) 

5.11 2 THE USE OF POLAR GRAPH PAPER 

For equations in which r is expressed in terms of 9 , it is convenient to plot values of r against 
values of 9 using a special kind of graph paper divided into small cells by concentric circles 
and radial lines. 

The radial lines are usually spaced at intervals of 15° and the concentric circles allow a scale 
to be chosen by which to measure the distances, r, from the pole. 

We illustrate with examples: 
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EXAMPLES 


1. Sketch the graph of the equation 

r = 2 sin#. 

Solution 

First we construct a table of values of r and #, in steps of 15°, from 0° to 360°. 


# 

0° 

15° 

30° 

45° 

60° 

75° 

90° 

105° 

120° 

135° 

150° 

165° 

180° 

195° 

r 

0 

0.52 

1 

1.41 

1.73 

1.93 

2 

1.93 

1.73 

1.41 

1 

0.52 

0 

-0.52 


# 

210° 

225° 

240° 

255° 

270° 

285° 

300° 

315° 

330° 

345° 

360° 

r 

-1 

-1.41 

-1.73 

-1.93 

-2 

-1.93 

-1.73 

-1.41 

-1 

-0.52 

0 



Notes: 

(i) The curve, in this case, is a circle whose cartesian equation turns out to be 

x 2 + y 2 — 2y = 0. 

(ii) The fact that half of the values of r are negative means, here, that the circle is 
described twice over. For example, the point (—0.52, 195°) is the same as the point 
(0.52,15°). 


2 . 


Sketch the graph of the following equations: 
(a) 


(b) 


r = 2(1 + cos#); 
r — 1 + 2 cos#; 


r — 5 + 3 cos#. 
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Solution 

(a) The table of values is as follows: 


e 

0° 

15° 

30° 

45° 

60° 

75° 

90° 

105° 

120° 

135° 

150° 

165° 

180° 

r 

4 

3.93 

3.73 

3.42 

3 

2.52 

2 

1.48 

1 

0.59 

0.27 

0.07 

0 


e 

195° 

210° 

225° 

240° 

255° 

270° 

285° 

300° 

315° 

330° 

345° 

360° 

r 

0.07 

0.27 

0.59 

1 

1.48 

2 

2.52 

3 

3.42 

3.73 

3.93 

4 


90 

4 



(b) The table of values is as follows: 


e 

0° 

15° 

30° 

45° 

60° 

75° 

90° 

105° 

120° 

135° 

150° 

165° 

r 

3 

2.93 

2.73 

2.41 

2 

1.52 

1 

0.48 

0 

-0.41 

-0.73 

-0.93 


e 

195° 

210° 

225° 

240° 

255° 

270° 

285° 

300° 

315° 

330° 

345° 

3 

r 

-0.93 

-0.73 

-0.41 

0 

0.48 

1 

1.52 

2 

2.41 

2.73 

2.93 

3 




(c) The table of values is as follows: 


9 

0° 

15° 

30° 

45° 

60° 

75° 

90° 

105° 

120° 

135° 

150° 

165° 

180° 

r 

8 

7.90 

7.60 

7.12 

6.5 

5.78 

5 

4.22 

3.5 

2.88 

2.40 

2.10 

2 


e 

195° 

210° 

225° 

240° 

255° 

270° 

285° 

300° 

315° 

330° 

345° 

360° 

r 

2.10 

2.40 

2.88 

3.5 

4.22 

5 

5.78 

6.5 

7.12 

7.60 

7.90 

8 


90 

8 



Note: 

Each of the three curves in the above example is known as a “limacon” and they illustrate 
special cases of the more general curve, r = a + 6 cos#, as follows: 

(i) If a = b, the limacon may also be called a “cardioid”; that is, a heart-shape. At the 
pole, the curve possesses a “cusp”. 

(ii) If a < b, the limacon contains a “re-entrant loop”. 

(iii) If a > b, the limacon contains neither a cusp nor a re-entrant loop. 

Other well-known polar curves, together with any special titles associated with them, may 
be found in the answers to the exercises at the end of this unit. 
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5.11.3 EXERCISES 


Plot the graphs of the following polar equations: 


1 . 


2 . 


r — 3 cos 9. 


r = sin 39. 


3. 

4. 


r = sin 29. 


r = 4 cos 39. 


5. 

6 . 

7. 


r — 5 cos 29. 


r = 2sin 2 0. 


r = 2cos 2 9. 


r 2 = 25 cos 29. 


9. 

10 . 


r 2 = 16 sin 29. 


r = 29. 
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5.11.4 ANSWERS TO EXERCISES 


1. The graph is as follows: 



Note: 

This is an example of the more general curve, r — a cos 6, which is a circle. 
2. The graph is as follows: 


90 

1 



Note: 

This is an example of the more general curve, r = a sin nd, where n is odd. It is an 
“n-leaved rose”. 
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3. The graph is as follows: 


90 

1 



Note: 

This is an example of the more general curve, r = a sin n9, where n is even. It is a 
“2n-leaved rose”. 

4. The graph is as follows: 


90 

4 



Note: 

This is an example od the more general curve, r = a cos n6, where n is odd. It is an 

“n-leaved rose”. 
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5. The graph is as follows: 


90 



Note: 

This is an example of the more general curve, r = a cos n9, where n is even. It is a 
“2n-leaved rose”. 

6. The graph is as follows: 


90 

2 



Note: 

This is an example of the more general curve, r = asin 2 d, which is called a 
“lemniscate” . 



7. The graph is as follows: 


90 

2 



Note: 

This is an example of the more general curve, r = acos 2 d, which is also called a 
“lemniscate” . 

8. The graph is as follows: 


90 

5 



Note: 

This is an example of the more general curve, r 2 = a 2 cos 29. ft is another example of 
a “lemniscate”; but, since r 2 cannot be negative, there are no points on the curve in 
the intervals f < 9 < ^ and <9 < 7 -f. 
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9. The graph is as follows: 


90 

4 



Note: 

This is an example of the more general curve, r 2 = a 2 sin 26. It is another example of 
a “lemniscate” ; but, since r 2 cannot be negative, there are no points on the curve in 
the intervals | < 6 < n and ^ < 6 < 2ir. 

10. The graph is as follows: 



Note: 

This is an example of the more general curve, r = ad, a > 0, which is called an 

“Archimedean spiral” . 
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UNIT 6.1 - COMPLEX NUMBERS 1 - DEFINITIONS AND ALGEBRA 
6.1.1 THE DEFINITION OF A COMPLEX NUMBER 

Students who are already familiar with the Differential Calculus may appreciate that equa- 
tions of the form 


d2 y , h d y , t< ^ 

a d^ +h d X +cl, = f(x) - 

which are called “Differential Equations”, have wide-reaching applications in science and 
engineering. They are particularly applicable to problems involving either electrical circuits 
or mechanical vibrations. 

It is possible to show that, in order to determine a formula (without derivatives) giving the 
variable y in terms of the variable x, one method is to solve, first, the quadratic equation 
whose coefhcents are a, b and c and whose solutions are therefore 


—b ± y/b 2 — 4 ac 
2 a 


Note: 

Students who are not already familiar with the Differential Calculus should consider only 
the quadratic equation whose coefficients are a, b and c, ignoring references to differential 
equations. 

ILLUSTRATION 

One method of solving the differential equation 


d 2 _y 

dx 2 



+ 13 


2 sin x 


would be to solve, first, the quadratic equation whose coefficients are 1, —6 and 13. 

Its solutions are 

6 ± V36 - 52 _ 6± /T6 
2 ~~ 2 

which clearly do not exist since we cannot find the square root of a negative number in 
elementary arithmetic. 
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However, if we assume that the differential equation represents a genuine scientific problem 
with a genuine scientific solution, we cannot simply dismiss the result obtained from the 
quadratic formula. 

The difficulty seems to be, not so much with the —16 but with the minus sign in front of the 
16. We shall therefore write the solutions in the form 

6 ± 

2 

Notes: 

(i) The symbol \J — 1 will be regarded as an “imaginary” number. 

(ii) In mathematical work, y /— 1 is normally denoted by % but, in scientific work it is denoted 
by j in order to avoid confusion with other quantities (eg. electric current) which could be 
denoted by the same symbol. 

(iii) Whenever the imaginary quantity j = \f— T occurs in the solutions of a quadratic 
equation, those solutions will always be of the form a + bj (or a + jb), where a and b are 
ordinary numbers of elementary arithmetic. 

DEFINITIONS 

1. The term “complex number” is used to denote any expression of the form a + bj or 
a + jb where a and b are ordinary numbers of elementary arithmetic (including zero) 
and j denotes the imaginary number \/ — 1; i.e. j 2 = —1. 

2. If the value a happens to be zero, then the complex number a + bj or a + jb is called 
“purely imaginary” and is written bj or jb. 

3. If the value b happens to be zero, then the complex number a + bj or a + jb is defined 
to be the same as the number a and is called “real”. That is a + jO = a + Oj = a. 

4. For the complex number a + bj or a + jb, the value a is called the “real part” and the 
value b is called the “imaginary part”. Notice that the imaginary part is b and not 

jb- 

5. The complex numbers a ± bj are said to form a pair of “complex conjugates” and 
similarly a ± jb form a pair of complex conjugates. Alternatively, we may say, for 
instance, that a — jb is the complex conjugate of a + jb and a + jb is the complex 
conjugate of a — jb. 

Note: 

In some work on complex numbers, especially where many complex numbers may be under 
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discussion at the same time, it is convenient to denote real and imaginary parts by the 
symbols x and y respectively, rather than a and b. It is also convenient, on some occasions, 
to denote the whole complex number x + jy by the symbol z in which case the conjugate, 
x — jy, will be denoted by z. 

6.1.2 THE ALGEBRA OF COMPLEX NUMBERS 
INTRODUCTION 

An “Algebra” (coming from the Arabic word AL-JABR) refers to any mathematical system 
which uses the concepts of equality, addition, subtraction, multiplication and division. For 
example, the algebra of real numbers is what we normally call “arithmetic” ; but algebraical 
concepts can be applied to other mathematical systems of which the system of complex 
numbers is one. 

In meeting a new mathematical system for the first time, the concepts of equality, addition, 
subtraction, multiplication and division need to be properly defined, and that is the purpose 
of the present section. In some cases, the definitions are fairly obvious, but need to be made 
without contradicting ideas already established in the system of real numbers which complex 
numbers include. 

(a) EQUALITY 

Unlike a real number, a complex number does not have a “value” ; and so the word “equality” 
must take on a meaning, here, which is different from that used in elementary arithmetic. 
In fact two complex numbers are defined to be equal if they have the same real part and the 
same imaginary part. 

That is 


a + jb = c + jd if and only if a = c and b = d 


EXAMPLE 

Determine x and y such that 


(2x — 3 y) + j(x + by) = 11 — j 14. 
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Solution 


From the definition of equality, we may 

EQUATE REAL AND IMAGINARY PARTS. 

Thus, 

2a; — 3 y = 11, 
x + 5y = —14 

These simultaneous linear equations are satisfied by x = 1 and y = —3. 

(b) ADDITION AND SUBTRACTION 

These two concepts are very easily defined. We simply add (or subtract) the real parts and 
the imaginary parts of the two complex numbers whose sum (or difference) is required. 

That is, 

(a + jb ) + (c + jd) = (a + c) + j(b + d) 

and 

(a + jb) — (c + jd) = (a — c) + j{b — d). 

EXAMPLE 

(-7 + j2) + (10 - j5) = 3 - j3 = 3(1 - j) 

and 

(—7 + j2) — (10 — j5) = —17 + j7. 
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(c) MULTIPLICATION 

The definition of multiplication essentially treats j in the same way as any other algebraic 
symbol, but uses the fact that j 2 = —1. 

Thus, 


(a + jb)(c + jd) = (ac — bd ) + j{bc + ad ); 

but this is not so much a formula to be learned off-by-heart as a technique to be applied in 
future examples. 


EXAMPLES 


1 . 


2 . 

3. 


(5+j9)(2 + j6) = (10-54) +j (18 + 30) = -44 + j48. 
(3 - j8)(l + j4) = (3 + 32) + j(- 8 + 12) = 35 + j4. 
(a + jb)(a — jb ) = a 2 + b 2 . 


Note: 

The third example above will be useful in the next section. It shows that the product of 
a complex number and its complex conjugate is always a real number consisting 
of the sum of the squares of the real and imaginary parts. 

(d) DIVISION 

The objective here is to make a definition which provides the real and imaginary parts of 
the complex expression 


a + jb 
c + jd 

Once again, we make this definition in accordance with what would be obtained algebraically 
by treating j in the same way as any other algebraic symbol, but using the fact that j 2 = — 1. 
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The method is to multiply both the numerator and the denominator of the complex ratio 
by the conjugate of the denominator giving 

a + jb c — jd ( ac + bd) + j(bc — ad) 

c + jd c — jd c 2 + d? 

The required definition is thus 

a + jb (ac + bd) + j (be — ad) 
c + jd c 2 + d? 

which, again, is not so much a formula to be learned off-by-heart as a technique to be applied 
in future examples. 

EXAMPLES 

1. 

5 + j3 _ 5 + j3 2 - j7 

2-./T 2 - ./7 2 j~ 

_ (10 + 21) +j( 6 - 35) _ 31 - j29 
~ 2 2 + 7 2 ““ 53 ' 

Hence, the real part is || and the imaginary part is — 1|. 

2. 

6 + j 6 + j —j2 — 4 
j2 — 4 “ j2 — 4’ —j2 — 4 

(-24 + 2) + j(—4 - 12) _ -22 - j 16 
(-2) 2 + (-4) 2 “ 20 ' 

Hence, the real part is — ^ and the imaginary part is — 
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6.1.3 EXERCISES 


1. Simplify the following: 

(a) j 3 ; (b) j 4 ; (c) j 5 ; (d) j 15 ; (e) j 22 . 

2. If ^! = 2 — j 5, £2 = 1 + j'7 and Z 3 = —3 — j 4, determine the following in the form a + jb: 

(a) 

£1 — ^2 + £3; 

(b) 

2 ^i + Z2 — Z3; 

(c) 

Zi - ( 4 z 2 - 23); 

(d) 

21. 

2 2 ’ 

(e) 

22 . 

23’ 

(f) 

^3 

Zl' 

3. Determine the values of x and y such that 

(3x — 5 y) + j(x + 3 y) = 20 + j2. 


4. Determine the real and imaginary parts of the expression 

(1-j3) 2 +j(2 + j 5)- 3(4 ~ j) . 

1 - J 

5. If z = x + jy and z = x — jy are conjugate complex numbers, determine the values of 
x and y such that 


4zz-3(z-z) =2 +j. 
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6.1.4 ANSWERS TO EXERCISES 


1. (a) -j; (b) 1; (c) j; (d) -j; (e) -1. 

2. (a) 

4 — J 16; 

(b) 


(c) 

(d) 


8 + j; 


-5-j37; 


-0.66 - j0.38; 


(e) 

(f) 


-l.24-j0.68; 


0.48- j0.79 


3. 


x = 5 and y — — 1 


4. The real part = —20.5; the imaginary part = —8.5 

5. 
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UNIT 6.2 - COMPLEX NUMBERS 2 


THE ARGAND DIAGRAM 
6.2.1 INTRODUCTION 

It may be observed that a complex number x + jy is completely specified if we know the 
values of x and y in the correct order. But the same is true for the cartesian co-ordinates, 
(x,y), of a point in two dimesions. There is therefore a “one-to-one correspondence” 
between the complex number x + jy and the point with co-ordinates ( x,y ). 

Hence it is possible to represent the complex number x-\-jy by the point (x, y) in a geometrical 
diagram called the Argand Diagram: 


V 


{x,y) 


O 


DEFINITIONS: 


1. The a;-axis is called the “real axis” since the points on it represent real numbers. 

2. The y-axis is called the “imaginary axis” since the points on it represent purely 
imaginary numbers. 

6.2.2 GRAPHICAL ADDITON AND SUBTRACTION 

If two complex numbers, z\ — x\ + jy i and Z2 = X2 + jy2, are represented in the Argand 
Diagram by the points Pi(xi,?/i) and P2<A2,2/2) respectively, then the sum, z± + Z2, of the 
complex numbers will be represented by the point Q(x± + X2, yi +2/2)- 

If O is the origin, it is possible to show that Q is the fourth vertex of the parallelogram 
having OPi and OP 2 as adjacent sides. 
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In the diagram, the triangle ORPi has exactly the same shape as the triangle P2SQ. Hence, 
the co-ordinates of Q must be (xi + x 2 , lj\ + y 2 ). 

Note: 

The difference z\ — z 2 of the two complex numbers may similarly be found by completing the 
parallelogram of which two adjacent sides are the straight line segments joining the origin 
to the points with co-ordinates (xi,yi) and (—x 2 , — y 2 ). 

6.2.3 MULTIPLICATION BY j OF A COMPLEX NUMBER 

Given any complex number z = x + jy, we observe that 


jz =j(x + jy ) = -y + jx. 


Thus, if z is represented in the Argand Diagram by the point with co-ordinates A(x, y ), then 
jz is represented by the point with co-ordinates B(—y,x). 
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But OB is in the position which would be occupied by OA if it were rotated through 90° in 
a counter-clockwise direction. 

We conclude that, in the Argand Diagram, multiplication by j of a complex number rotates, 
through 90° in a counter-clockwise direction, the straight line segment joining the origin to 
the point representing the complex number. 

6.2.4 MODULUS AND ARGUMENT 



(a) Modulus 

If a complex number, z = x + jy is represented in the Argand Diagram by the point, P, 
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with cartesian co-ordinates (x,y) then the distance, r, of P from the origin is called the 
“modulus” of z and is denoted by either \z\ or \x + jy\. 

Using the theorem of Pythagoras in the diagram, we conclude that 


r = z 


\x + jy | = \/x 2 + 


Note: 

This definition of modulus is consistent with the definition of modulus for real numbers 
(which are included in the system of complex numbers). For any real number x, we may say 
that 


x\ — \x + j0| = \/ x 2 + 0 2 = Vx 2 , 


giving the usual numerical value of x. 

ILLUSTRATIONS 

1 . 

|3 - j4| = ^3 2 + (-4) 2 = v^25 = 5. 


2 . 

3. 


\i+j\ = v / h + h = V 2 . 

\j7\ = |0 + j7\ = Vo 2 + 72 = V4Q = 7. 


Note: 

The result of the last example above is obvious from the Argand Diagram since the point on 
the y-axis representing j 7 is a distance of exactly 7 units from the origin. In the same way, 
a real number is represented by a point on the x-axis whose distance from the origin is the 
numerical value of the real number. 

(b) Argument 

The “argument” (or “amplitude”) of a complex number, z, is defined to be the angle, 9, 
which the straight line segment OP makes with the positive real axis (measuring 9 positively 
from this axis in a counter-clockwise sense). 
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In the diagram, 


tan 9 = — ; that is, 9 = tan l — . 
x x 

Note: 

For a given complex number, there will be infinitely many possible values of the argument, 
any two of which will differ by a whole multiple of 360°. The complete set of possible values 
is denoted by Arg^, using an upper-case A. 

The particular value of the argument which lies in the interval —180° < 9 < 180° is called 
the “principal value” of the argument and is denoted by arg z using a lower-case a. The 
particular value, 180°, in preference to —180°, represents the principal value of the argument 
of a negative real number. 

ILLUSTRATIONS 

1. 

1 

71 

where k may be any integer. But we note that 

ar g(v / 3 + j) = 30° only. 

2 . 

Arg(— 1 + j) = tan _1 (— 1) = 135° + A;360° 

but not —45° + /c360°, since the complex number — 1 + j is represented by a point in 
the second quadrant of the Argand Diagram. 

We note also that 

arg(— 1 + j) = 135° only. 

3. 

Arg(— 1 - j ) = tan _1 (l) = 225° + A;360° or - 135° + A;360° 

but not 45° + k360° since the complex number — 1 — j is represented by a point in the 
third quadrant of the Argand Diagram. 

We note also that 

arg(— 1 — j) = —135° only. 


j = 30° + A;360°, 


Arg(v / 3 + j) = tan 1 ^ 


5 



Note: 

It is worth mentioning here that, in the Argand Diagram, the directed straight line segment 
described from the point Pi (representing the complex number z\ — X\ + jy\) to the point 
P 2 (representing the complex number z 2 = x 2 + jy 2 ) has length, r, equal to \z 2 — z\ |, and is 
inclined to the positive direction of the real axis at an angle, 9, equal to arg(z 2 — Zi). This 
follows from the relationship 


z 2 - Zi = (x 2 - aq) + j(y 2 - y{) 

in which x 2 — X\ and y 2 — y\ are the distances separating the two points, parallel to the real 
axis and the imaginary axis respectively. 



6.2.5 EXERCISES 

1. Determine the modulus (in decimals, where appropriate, correct to three significant 
figures) and the principal value of the argument (in degrees, correct to the nearest 
degree) of the following complex numbers: 

(a) 

i — j; 


—3 + j4; 


\/2-jV2; 


(b) 


6 



(d) 

1 _ .y/3' 

O C\ 1 


-7-j9. 

2. If z — 4 — j 5, verify that jz has the same modulus as z but that the principal value of 
the argument of jz is greater, by 90° than the principal value of the argument of z. 

3. Illustrate the following statements in the Argand Diagram: 

(a) 

(6 — jll) + (5 + j3) = 11 — j8; 

(b) 

(6-jll)-(5 + j3) = — 1— jU. 

6.2.6 ANSWERS TO EXERCISES 

1. (a) 1.41 and -45°; 

(b) 5 and 127°; 

(c) 2 and -135°; 

(d) 1 and —60°; 

(e) 11.4 and -128°. 

2. 4 — j-5 has modulus v^4l and argument —51°; 

j( 4 — j 5) = 5 + j4 has modulus \/41 and argument 39° = —51° + 90°. 

3. Construct the graphical sum and difference of the two complex numbers. 
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UNIT 6.3 - COMPLEX NUMBERS 3 
THE POLAR AND EXPONENTIAL FORMS 
6.3.1 THE POLAR FORM 



From the above diagram, we may observe that 

— = cos 9 and - = sin 9. 

Hence, the relationship between x, y, r and 9 may also be stated in the form 

x — r cos 9 , y — r sin 9 , 

which means that the complex number x + jy may be written as r cos 9 + jr sin 6. In other 
words, 

x + jy = r(cos 9 + j sin 9). 

The left-hand-side of this relationship is called the “rectangular form” or “cartesian 
form” of the complex number while the right- hand-side is called the “polar form”. 


1 



Note: 

For convenience, the polar form may be abbreviated to rZ$, where 6 may be positive, negative 
or zero and may be expressed in either degrees or radians. 

EXAMPLES 

1. Express the complex number z = \/3 + j in polar form. 

Solution 


z\ — r — v / 3 _ +T = 2 


and 

Avgz = 9 = tan -1 = 30° + A;360°, 

where k may be any integer. 

Alternatively, using radians, 

7T 

Argz = — h k2TT, 

6 

where k may be any integer. 

Hence, in polar form, 

z = 2(cos[30° + A;360°] + j sin[30° + £360°]) = 2Z[30° + A;360°] 


( cos 

_7r 

— + k2n 

+ j sin 

_7r 

— + k2n 

) = 2Z 

_7r 

— + k27T 

V 

16 \ 


16 \ 

J 

l_6 J 


2. Express the complex number z = —1 — j in polar form. 

Solution 


z\ — r — vT+T = \pl 


and 


Arg^ = 6 = tan _1 (l) = -135° + A;360°, 
where k may be any integer. 
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Alternatively, 


Axgz 


37T 

T 


+ k2n, 


where k may be any integer. 

Hence, in polar form, 

z = \/2(cos[— 135° + A;360°] + jsin[-135° + £360°]) = \/2Z[-135 0 + A;360°] 
or 


( cos 

3vr 

— — + k2ix 

+ j sin 

3vr 

— — + k2n 

)=V2L 

3tt , 
— — + k2ix 

V 

V 4 J 


l 4 J 

J 

[ 4 J 


Note: 

If it is required that the polar form should contain only the principal value of the argument, 
6 , then, provided —180° < 6 < 180° or — n < 6 < n, the component /c360° or k2n of the 
result is simply omitted. 

6.3.2 THE EXPONENTIAL FORM 

Using some theory from the differential calculus of complex variables (not included here) it 
is possible to show that, for any complex number, z, 


and 


, z z 2 z 3 z 4 

' =1 + H + 2! + 3! + 4! + " 

~3 ^5 ~7 

sin Z = Z - — + — - — + 


z 4 z 6 

C0SS = 1 -2! + n-6! + 


These are, in fact, taken as the definitions of the functions e z ,sinz and cos^. 

Students who are already familiar with the differential calculus of a real variable, x, may 
recognise similarities between the above formulae and the “MacLaurin Series” for the func- 
tions e x ,sina; and cosx. In the case of the series for sin x and cos a:, the value, x, must be 
expressed in radians and not degrees. 

A useful deduction can be made from the three formulae if we make the substitution z = jO 
into the first one, obtaining: 


3 



j* = 1 + iA+W.W.W. 

+ 1! + 2! + 3! + 4! 

and, since j 2 = —1, this gives 


^=i +4 


d 2 .d 3 e 4 

7 1 h 

2! J 3! 4! 


On regrouping this into real and imaginary parts, then using the sine and cosine series, we 
obtain 


V e = cos 9 + j sin 9, 

provided 9 is expressed in radians and not degrees. 

The complex number x+jy, having modulus r and argument # + /c27t, may thus be expressed 
not only in polar form but also in 


the exponential form, re^ e . 


ILLUSTRATIONS 

Using the examples of the previous section 

1. 

VS + j = 2e^ +fc2 U>. 

2 . 

-1 +j = 

3. 

-1 -j = V2e~ j (^ +k2w ) . 

Note: 

If it is required that the exponential form should contain only the principal value of the 
argument, 9, then, provided — 7r < 9 < tt, the component k2n of the result is simply omitted. 


4 



6.3.3 PRODUCTS AND QUOTIENTS IN POLAR FORM 


Let us suppose that two complex numbers z\ and z 2 have already been expressed in polar 
form, so that 


zi = ri(cos6h + jsindi) = ri/6i 


and 


z 2 = r 2 (cos 9 2 + j sin 0 2 ) = r 2 /_0 2 . 


It is then possible to establish very simple rules for determining both the product and the 
quotient of the two complex numbers. The explanation is as follows: 

(a) The Product 


Z\.z 2 = r 1 .r 2 (cos6 l i + j sind 1 ).(cos0 2 + jsin# 2 ). 


That is, 


z\.z 2 = ri.r 2 ([cos#i. cos^ — sin 6\. sind 2 ] + j[sin 6\. cos 0- 2 + cos 6\. sind 2 ]). 

Using trigonometric identities, this reduces to 

zi.z 2 = ri.r 2 (cos[0i + 0 2 \ + j sin[6>i + d 2 }) = rvr 2 Z[0i + d 2 \. 

We have shown that, to determine the product of two complex numbers in polar form, we 
construct the product of their modulus values and the sum of their argument values. 

(b) The Quotient 


Z\ rq (cos 9 1 + j sin 9i) 
z 2 r 2 (cos 0 2 + j sin 0 2 ) ' 

On multiplying the numerator and denominator by cos0 2 — j sin 9 2j we obtain 


5 



— = — ( [cos 9i . cos 62 + sin Oi . sin 0 2 ] + j [sin 9 X . cos d 2 — cos 9 \ . sin 9 2 ] ) . 
^2 f 2 

Using trigonometric identities, this reduces to 

— = — (cos[0i - 9 2 } + jsin[9 1 - 9 2 ]) = — L\9 X - 9 2 \. 
z 2 r 2 r 2 


We have shown that, to determine the quotient of two complex numbers in polar form, we 
construct the quotient of their modulus values and the difference of their argument values. 

ILLUSTRATIONS 

Using results from earlier examples: 

1 . 


(V3 + j).(-l - j) = 2Z30°.v / 2Z(— 135°) = 2 V / 2Z(-105°). 


We notice that, for all of the complex numbers in this example, including the result, 
the argument appears as the principal value. 


\/3+J 

-1 -3 


2Z30° 

v / 2Z(— 135°) 


\/2Z165 0 . 


Again, for all of the complex numbers in this example, including the result, the argument 
appears as the principal value. 

Note: 

It will not always turn out that the argument of a product or quotient of two complex 
numbers appears as the principal value. For instance, 

3. 


(-1 - j).{-V 3 - j) = v / 2Z(— 135°).2Z(— 150°) = 2 v / 2Z(-285°), 
which must be converted to 2v^2Z(75°) if the principal value of the argument is required. 


6 



6.3.4 EXERCISES 


In the following cases, express the complex numbers Z\ and z 2 in 

(a) the polar form, r/6 
and 

(b) the exponential form, re 3 > 
using only the principal value of 6. 

(c) For each case, determine also the product, Z 1 .Z 2 , and the quotient, in polar form using 
only the principal value of the argument. 

1 . 

zi = 1 + j, z 2 = ^3- j. 

2 . 


3. 


Zi = -V2 + jy/2, z 2 = —3 — jA. 
Zi = —4 — j5, z 2 = 7-j9. 


6.3.5 ANSWERS TO EXERCISES 


1. (a) 

(b) 


z\ — \/2Z45 0 z 2 = 2Z(— 30°); 
z\ = V2e 3 * z 2 = 2e~ 3 s] 

z\.z 2 — 2\/2Z15 0 — = — Z75°. 

£2 2 


2. (a) 


z\ = 2Z(135°) = 5Z(-127°); 


7 



(b) 


Zl = 2e^ z 2 = 5e-> 2 - 22 ; 


(c) 


3. (a) 

(b) 

(c) 


*1.32 = 10Z8° — = ^Z(— 98°). 
z 2 5 

Z! = 6. 40Z(— 128.66°) z 2 = 11.40Z(-55.13°) 
Z! = 6.40e~ j2 - 25 z 2 = 11.40e- jO - 96 ; 


Z!.z 2 = 72.96Z176.21° — = 0.56Z(-73.53°) 

z 2 
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UNIT 6.4 - COMPLEX NUMBERS 4 


POWERS OF COMPLEX NUMBERS 

6.4.1 POSITIVE WHOLE NUMBER POWERS 

As an application of the rule for multiplying together complex numbers in polar form, it is 
a simple matter to multiply a complex number by itself any desired number of times. 

Suppose that 

z = rLO. 


Then, 


z 2 = r.rL{9 + 9) = r 2 Z2$; 

z 3 = z.z 2 = r.r 2 l(9 + 2 9) = r 3 1.39] 
and, by continuing this process, 


z n = r n Ln9. 

This result is due to De Moivre, but other aspects of it will need to be discussed before we 
may formalise what is called “De Moivre’s Theorem”. 

EXAMPLE 


( 1 . 1 \ 19 

"7T" 


'197 r' 


'37 r' 

{V2 +3 72) 

A. 

) 19 = 1Z 

4 

= 1Z 

. 4 . 


1 . 1 
V2 +3 V2' 


6.4.2 NEGATIVE WHOLE NUMBER POWERS 

If n is a negative whole number, we shall suppose that 

n = — m, 


where m is a positive whole number. 
Thus, if z — r/9, 


1 



1 


1 

r m /_m9 


z — z ' = — = 


In more detail. 


z 


n 


1 

r m ( cos m6 + j sin m6) ’ 


giving 


z 


n 


l 

rpUl 


(cos rnO — j sin m6) 
cos 2 m6 + sin 2 m9 


= r m (cos[ — mO] + j sin[— m6}). 


But — m = n, and so 


z n = r n (cosn9 + j sin n6) = r n fnO , 


showing that the result of the previous section remains true for negative whole number 
powers. 

EXAMPLE 


(V3 + j)~ 3 = (2Z30 0 )- 3 = gZ(-90°) = - J -. 

6.4.3 FRACTIONAL POWERS AND DE MOIVRE’S THEOREM 

To begin with, here, we consider the complex number 

1 

z n , 

where n is a positive whole number and z = rZ0. 

We define z» to be any complex number which gives z itself when raised to the power n. 
Such a complex number is called “an n-th root of z n . 
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Certainly one such possibility is 


i 9 

r " <£ — , 

n 


by virtue of the paragraph dealing with positive whole number powers. 


But the general expression for z is given by 


z = r£{6 + k360°), 


where k may be any integer; and this suggests other possibilities for z», namely 


i 9 + k 360° 

r » L . 

n 


However, this set of n-th roots is not an infinite set because the roots which are given by 

k — 0,1, 2, 3 n — 1 are also given by k — n, n + 1, n + 2, n + 3, , 2 n — 1, 2 n, 

2 n + 1, 2 n + 2, 2 n + 3, and so on, respectively. 


We conclude that there are precisely n n-th roots given by k = 0, 1, 2, 3 , n — 1. 


EXAMPLE 


Determine the cube roots (i.e. 3rd roots) of the complex number j 8. 


Solution 


We first write 

j8 = 8Z(90° + A;360°). 


Hence, 


. . . i 0 i (90° + /c360°) 
(j8)3 =8W^ 


where k — 0, 1, 2 


3 



The three distinct cube roots are therefore 

2Z30°,2Z150° and 2Z270 0 = 2Z(-90°). 

They all have the same modulus of 2 but their arguments are spaced around the Argand 
Diagram at regular intervals of yy = 120°. 

Notes: 

(i) In general, the n-th roots of a complex number will all have the same modulus, but their 
arguments will be spaced at regular intervals of yy. 

(ii) Assuming that —180° < 6 < 180°; that is, assuming that the polar form of z uses the 
principal value of the argument, then the particular n-th root of z which is given by k = 0 

is called the “principal n-th root”. 

(iii) If — is a fraction in its lowest terms, we define 

m 

Z n 

/ l \ m / \ 1 

to be either ( ) or (z m ) n both of which turn out to give the same set of n distinct results. 
The discussion, so far, on powers of complex numbers leads us to the following statement: 

DE MOIVRE’S THEOREM 

If z = rZ0, then, for any rational number n, one value of z 11 is r n /n6. 

6.4.4 EXERCISES 


1. Determine the following in the form a + jb , expressing a and b in decimals correct to 
four significant figures: 

(a) 

(1 + j'a/3) 10 ; 

(b) 

(2-J5)- 4 . 

2. Determine the fourth roots of j’81 in exponential form re 3> where r > 0 and —7 r < 9 < n. 

3. Determine the fifth roots of the complex number — 4 + j4 in the form a + j b expressing 
a and b in decimals, where appropriate, correct to two places. State also which root is 
the principal root. 
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4. Determine all the values of 

(3 + j4)i 

in polar form. 


6.4.5 ANSWERS TO EXERCISES 


1. (a) 


(1 + jV3) 10 = -512.0 - j 886.8; 


(b) 


2. The fourth roots are 


3. The fifth roots are 


(2 - j5)” 4 = 5.796 - jl. 188 


3e * . 


3tt 

3e * . 


_ 7tt 

3e * . 


3e _ "8" 


1.26 + j0.64, — 0.22 + j 1.40, - 1.40 +j0.22, - 0.64-jl.26, 1 - j. 


The principal root is 1.26 + j0.64. 

4. There are two values, namely 


11.18779.695° and 11.18Z(-100.305°). 
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UNIT 6.5 - COMPLEX NUMBERS 5 


APPLICATIONS TO TRIGONOMETRIC IDENTITIES 
6.5.1 INTRODUCTION 

It will be useful for the purposes of this section to restate the result known as “Pascal’s 
Triangle” previously discussed in Unit 2.2. 

If n is a positive whole number, the diagram 


1 1 

12 1 
13 3 1 

1 4 6 4 1 

provides the coefficients in the expansion of (A + B) n which contains the sequence of terms 

A n , A n ~ 1 B, A n ~ 2 B 2 , A n ~ 3 B 3 , ,B'\ 

6.5.2 EXPRESSIONS FOR cos nd AND sin n6 IN TERMS OF cos# AND sin#. 
From De Moivre’s Theorem 


(cos # + j sin 9) n = cos nO + j sin n6, 


from which we may deduce that, in the expansion of the left-hand-side, using Pascal’s Trian- 
gle, the real part will coincide with cos n6 and the imaginary part will coincide with sin n6. 

EXAMPLE 


(cos# + j sin#) 3 = cos 3 # + 3cos 2 #.(j sin#) + 3 cos #.(j sin#) 2 + (j sin#) 3 . 


That is, 


cos 3# = cos 3 # — 3 cos #.sin 2 # or 4cos 3 # — 3 cos #, 


1 



using sin 2 # = 1 — cos 2 #; 
and 


sin 3 # = 3cos 2 #. sin 6 — sin 3 # or 3 sin # — 4sin 3 #, 


using cos 2 # = 1 — sin 2 #. 

6.5.3 EXPRESSIONS FOR cos”# AND sin”# IN TERMS OF SINES AND COSINES 
OF WHOLE MULTIPLES OF #. 

The technique described here is particularly useful in calculus problems when we are required 
to integrate an integer power of a sine function or a cosine function. It does stand, however, 
as a self-contained application to trigonometry of complex numbers. 

Suppose 


z = cos # + j sin # — (1) 

Then, by De Moivre’s Theorem, or by direct manipulation, 

1 

— = cos # — j sin # — (2). 

Adding (1) and (2) together, then subtracting (2) from (1), we obtain 



z — - = j 2 sin # 


Also, by De Moivre’s Theorem, 


and 


z n = cos rid + j sin n6 — (3) 


— = cos nQ — j sin n6 — (4). 


2 




Adding (3) and (4) together, then subtracting (4) from (3), we obtain 


z n + A = 2 cos n# 

z n 


z n — A = j 2 sin nd 


We are now in a position to discuss some examples on finding trigonometric identities for 
whole number powers of sin # or cos #. 

EXAMPLES 


1. Determine an identity for sin 3 #. 

Solution 

We use the result 


j 3 2 3 sin 3 # 



3 


where z = cos # + j sin 9. 
That is, 


— j8sin 3 # = - 3z 2 . 


1 

z 



or, after cancelling common factors, 


— j8sin 3 # = z 3 - 3z + 


3 1 

z z 3 



1 

z 3 




5 


which gives 


Hence, 


— j8sin 3 # = j 2 sin 39 — j 6 sin 9. 


sin 3 # = - (3 sin # — sin 3#) . 


3 




2. Determine an identity for cos 4 #. 
Solution 
We use the result 


2 4 cos 4 # = 




4 

1 


where z = cos # + j sin 6. 
That is, 


16cos 4 # = z 4 + 4^ 3 .- + 6z 2 . + 4*.(±) + 


or, after cancelling common factors, 


16cos 4 d = z 4 + Az 2 + 6 + 


+ -r = - 4 + 


+ 4 U + 


i) + 


which gives 


16cos 4 d = 2 cos 49 + 8 cos 29 + 6. 


Hence, 

cos 4 d = - (cos 49 + 4 cos 29 + 3) 
8 


6.5.4 EXERCISES 

1. Use a complex number method to determine identities for cos4d and sin 49 in terms of 
sin 9 and cosd. 

2. Use a complex number method to determine an identity for sin 5 # in terms of sines of 
whole multiples of 9. 

3. Use a complex number method to deternine an identity for cos 6 # in terms of cosines of 
whole multiples of #. 


4 



6.5.5 ANSWERS TO EXERCISES 


1 . 


cos 4# = cos 4 # — 6cos 2 #.sin 2 # 


and 


2 . 


3. 


sin 4# = 4cos 3 #.sin# — 4 cos#. sin 3 #. 


sin 5 # 


— (sin 5# — 5 sin 3# + 10 sin #] 
16 


cos 6 # 


1 

32 


(cos 6# + 6 cos 4# + 15 cos 2# + 10). 
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UNIT 6.6 - COMPLEX NUMBERS 6 


COMPLEX LOCI 
6.6.1 INTRODUCTION 

In Unit 6.2, it was mentioned that the directed line segment joining the point representing 
a complex number z \ to the point representing a complex number z^ is of length equal 
to \z 2 — z\ | and is inclined to the positive direction of the real axis at an angle equal to 

arg(^ 2 - zi). 


This observation now has significance when discussing variable complex numbers which are 
constrained to move along a certain path (or “locus”) in the Argand Diagram. For many 
practical applications, such paths (or “loci”) will normally be either straight lines or circles 
and two standard types of example appear in what follows. 

In both types, we shall assume that z — x + jy denotes a variable complex number (rep- 
resented by the point (x,y) in the Argand Diagram), while z 0 = x 0 + jy 0 denotes a fixed 
complex number (represented by the point (x 0 , yo) hi the Argand Diagram). 

6.6.2 THE CIRCLE 

Suppose that the moving point representing z moves on a circle, with radius a, whose centre 
is at the fixed point representing z 0 . 


y 



x 


1 



Then the distance between these two points will always be equal to a. In other words, 


z — Zq\ = a 


and this is the standard equation of the circle in terms of complex numbers. 

Note: 

By substituting z = x + jy and zo = xo + jyo in the above equation, we may obtain the 
equivalent equation in terms of cartesian co-ordinates, namely, 


| (a: - x 0 ) +j(y - y 0 )\ = a. 


That is, 


(x - x 0 ) 2 + (y- yof = a 2 . 


ILLUSTRATION 

The equation 


\z — 3 + j’4| = 7 

represents a circle, with radius 7, whose centre is the point representing the complex number 
3 — j4. 

In cartesian co-ordinates, it is the circle with equation 

(x - 3) 2 + (y + 4) 2 =49. 

6.6.3 THE HALF-STRAIGHT-LINE 

Suppose now that the “directed” straight line segment described from the fixed point 
representing Zo to the moving point representing z is inclined at an angle 9 to the positive 
direction of the real axis. 


2 



Then, 


arg (z - zo) = 0 

and this equation is satisfied by all of the values of z for which the inclination of the directed 
line segment is genuinely 9 and not 180° — 6. The latter angle would correspond to points 
on the other half of the straight line joining the two points. 



Note: 

If we substitute z = x + jy and zq = xq + jyo, we obtain 

arg ([a: - x 0 ] + j[y ~ Vo]) = 0- 


That is, 


or 


tan 


i V - Vo 


X — Xo 


9 


y-y o = tan 9(x - x 0 ), 

which is certainly the equation of a straight line with gradient tan 9 passing through the 
point (To, yo)', but it represents only that half of the straight line for which x — x 0 and y — y 0 
correspond, in sign as well as value, to the real and imaginary parts of a complex number 
whose argument is genuinely 9 and not 180° — 9. 


3 



ILLUSTRATION 


The equation 


7T 

arg(z + 1 — j5) - - 

o 

represents the half-straight-line described from the point representing Zo = — 1 + j5 to the 
point representing z = x + jy and inclined to the positive direction of the real axis at an 
angle of — 



In terms of cartesian co-ordinates, 

7T 

arg([x + 1] + j[y— 5]) = 

o 

in which it must be true that x + 1 > 0 and y — 5 < 0 in order that the argument of 
[x + 1] + j [y — 5] may be a negative acute angle. 

We thus have the half-straight-line with equation 


y - 5 



l 
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(x + 1) 


which lies to the right of, and below the point (—1, 5). 


4 



6.6.4 MORE GENERAL LOCI 


Certain types of locus problem may be encountered which cannot be identified with either 
of the two standard types discussed above. The secret, in such problems is to substitute 
z = x + jy in order to obtain the cartesian equation of the locus. We have already seen that 
this method is applicable to the two standard types anyway. 

ILLUSTRATIONS 

1. The equation 


z-1 
z + 2 


3 


may be written 


That is, 


which simplifies to 


\z-l\ — 3\z + 2|. 

(x - l) 2 + y 2 = 3[(x + 2 ) 2 + y' 2 ], 
2x 2 + 2 y 2 + 14x + 13 = 0 


or 



23 

T’ 


representing a circle with centre 1,0^ and radius 
2. The equation 


may be written 


arg 


7T 

4 


arg (z — 3) — arg z 


7 r 

4' 
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That is, 


7T 

arg([a; - 3] + jy) - avg(x + jy ) = 

Taking tangents of both sides and using the trigonometric identity for tan (A — B), we 
obtain 

_y y 

x — 3 x -i 

1 + -^- y - ~ ' 

x—6 x 

On simplification, the equation becomes 

x 2 + y 2 — 3x — 3y = 0 


or 



9 

2 ’ 


the equation of a circle with centre (|, |) and radius b . 

However, we observe that the original complex number, 

z-3 

z 

cannot have an argument of | unless its real and imaginary parts are both positive. 
In fact, 

z — 3 (x — 3) + jy x — jy x(x — 3) + y 2 + j 3 

z x + jy x — jy x 2 + y 2 

which requires, therefore, that 


That is, 


x(x — 3) + y 2 > 0. 


x 2 + y 2 — 3x > 0 


or 

3\ 2 2 9 

X ~2) +V >4' 

Conclusion 

The locus is that part of the circle with centre (l> |) and radius which lies outside 
the circle with centre (§,o) and radius |. 
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6.6.5 EXERCISES 


1. Identify the loci whose equations are 

(a) 

k — 3| =4; 

(b) 

\z-A + j7\ = 2. 

2. Identify the loci whose equations are 

(a) 

, vr 

arg {z + 1) = 

(b) 

3vr 

arg(^ — 2 — j3) = — . 

3. Identify the loci whose equations are 

(a) 

z+j2 
z-J 3 

(b) 
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6.6.6 ANSWERS TO EXERCISES 


1. (a) A circle with centre (3, 0) and radius 4; 

(b) A circle with centre 4, —7) and radius 2. 

2. (a) A half-straight-line to the right of, and above the point (—1, 0) inclined at an angle 

of | to the positive direction of the real axis; 

(b) A half-straight-line below the point (2, 3) and perpendicular to the real axis. 

3. (a) The straight line y — 

(b) That part of the circle x 2 + y 2 = 1 which lies outside the circle with centre ^ 
and radius \ and above the straight line whose equation is y = x — 1. 

Note: 

Examples like No. 3(b) are often quite difficult and will not normally be included in the 
more elementary first year courses in mathematics. 
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UNIT 7.1 - DETERMINANTS 1 


SECOND ORDER DETERMINANTS 

7.1.1 PAIRS OF SIMULTANEOUS LINEAR EQUATIONS 

The subject of Determinants may be introduced by considering, first, a set of two simulta- 
neous linear equations in two unknowns. We shall take them in the form 


a\x + biy + ci = 0, (1) 

a, 2 X + b 2 y + C2 = 0. (2) 


If we subtract equation (2) x b\ from equation (1) x b 2 , we obtain 

a\b 2 x — a 2 bix + C1&2 — c 2 b± = 0. 


Hence, 


&ic 2 - b 2 c l 
a\b 2 — ct2^i 


provided that a\b 2 — a 2 b\ 7^ 0. 

Similarly, if we subtract equation (2) x cq from equation (1) x a 2 , we obtain 


a 2biy — aib 2 y + 02 c i — Oj\c 2 — 0. 


Hence, 


Gqc 2 — o 2 Ci 

a\b 2 — a 2 b\ 


provided that a\b 2 — a 2 b\ 7^ 0. 

Note: 

Other arrangements of the solutions for x and y are possible, but the above arrangements 
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have been stated for a particular purpose which will be made clear shortly under “Observa- 
tions” . 

The Symmetrical Form of the solution 

The two separate solutions for x and y may be conveniently written in “symmetrical” form 
as follows: 


x —y 1 

b\C2 — &2C1 CI1C2 — CL2C1 Ol&2 ~~ ®2^1 


provided that a \ 62 — «2&i 7^ 0. 

7.1.2 THE DEFINITION OF A SECOND ORDER DETERMINANT 

Each of the denominators in the symmetrical form of the previous section has the same 
general appearance, namely the difference of the products of two pairs of numbers; and we 
shall rewrite each denominator in a new form using a mathematical symbol called a “second 
order determinant “ and defined by the statement: 


A B 
C D 


AD - BC. 


The symbol on the left-hand-side may be called either a second order determinant or a 
2x2 determinant; it has two “rows” (horizontally), two “columns” (vertically) and four 
“elements” (the numbers inside the determinant). 

7.1.3 CRAMER’S RULE FOR TWO SIMULTANEOUS LINEAR EQUATIONS 

The symmetrical solution to the two simultaneous linear equations may now be written 


X 


-y 


1 

b\ ci 


&l C\ 


ai bi 

b 2 c 2 


&2 c 2 


a 2 6 2 


provided that 


a 1 
®2 


61 

b>2 


7^ 0; 


or, in an abbreviated form, 


x —y 1 

Ai A 2 A 0 ’ 
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provided that A 0 ^ 0. 


This determinant rule for solving two simultaneous linear equations is called “Cramer’s 

Rule” and has equivalent forms for a larger number of equations. 

Note: 

The interpretation of Cramer’s Rule in the case when ai& 2 — a 2 &i = 0 will be dealt with as 

a special case after some elementary examples: 

Observations 

In Cramer’s Rule, 

1. The determinant underneath x can be remembered by covering up the x terms in the 
original simultaneous equations, then using the coefficients of y and the constant terms 
in the pattern which they occupy on the page. 

2. The determinant underneath y can be remembered by covering up the y terms in the 
original simultaneous equations, then using the coefficients of x and the constant terms 
in the pattern they occupy on the page. 

3. The determinant underneath 1 can be remembered by covering up the constant terms 
in the original simultaneous equations, then using the coefficients of x and y in the 
pattern they occupy on the page. 

4. The final determinant is labelled Ao as a reminder to evaluate it first; because, if it 
happens to be zero, there is no point in evaluating Ai and A 2 . 

EXAMPLES 

1. Evaluate the determinant 


Solution 

A = 7x5 — 4x (-2) = 35 + 8 = 43 
2. Express the value of the determinant 

A = - p - q 
v -q 



in terms of p and q. 
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Solution 


A = (— p) x (— q) — p x (— q) = p.q + p.q = 2 pq. 

3. Use Cramer’s Rule to solve for x and y the simultaneous linear equations 

5a: — 3 y = —3, 

2 x — y = —2. 


Solution 

We may first rearrange the equations in the form 


5x — 3y + 3 = 0, 
2x — y + 2 = 0. 


Hence, by Cramer’s Rule, 


where 


x —y 1 

Ai A 2 A 0 ’ 



-5 + 6 = 1; 


A: 


-3 3 
-1 2 


-6 + 3 = -3; 


Thus, 


A 2 


5 3 
2 2 


10 - 6 = 4. 


x = 


Ai 

An 


—3 and y 


A 2 

Aq 


-4. 
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Special Cases 

When using Cramer’s Rule, the determinant A 0 must not have the value zero. But if it does 
have the value zero, then, the simultaneous linear equations 


aix + biy + ci = 0 , ( 1 ) 

a, 2 X + b 2 y + C 2 = 0. (2) 


are such that 

a\b 2 — a 2&1 = 0. 

In other words, 

ai bi 

a 2 b 2 

which means that the x and y terms in one of the equations are proportional to the x and 
y terms in the other equation. 

Two situations may arise which may be illustrated by the following examples: 

EXAMPLES 

1. For the set of equations 

3a; — 2 y = 5, 

6a; — Ay = 10, 


A 0 = 0 but the second equation is simply a multiple of the first equation. That is, one 
of the equations is redundant and so there exists an infinite number of solutions; 
either of the variables may be chosen at random, with the remaining variable being 
expressible in terms of it. 

2. For the set of equations 

3a; — 2 y = 5, 

6a; — Ay = 7, 
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A 0 = 0 as before, but there is an inconsistency because, if the second equation is divided 
by 2, we obtain 

3a; — 2 y — 3.5, 

which is inconsistent with 

3a; — 2 y — 5. 


In this case there are no solutions at all. 


Summary of the Special Cases 

If Ao = 0, further investigation of the simultaneous linear equations is necessary. 

7.1.4 EXERCISES 


1. Write down the values of the following determinants: 



2 

3 

; (b) 

i 

-2 


-2 

3 

(a) 

1 

4 

2 

1 

; (c) 

-1 

2 

(d) 

X 

y 

y 

X 

; (e) 

X X 

y y ’ 

(f) 

a 

-b 

b 

a 


2. Use determinants (that is, ‘Cramer’s Rule’) to solve the following sets of simultaneous 
linear equations: 


19a; + 6 y = 39, ^ 3a; + Ay + 6 = 0, 

13a; -8 y = -6.’ W 5a; - 3y - 19 = 0. ’ 


(c) 


2x + 1 = 3 y, ' 


x 


- 5=7 y : 

3ii + 2i 2 = 5, _ 


(d) 

(f) 


2 y = 


x , 


7 + 3 y = 2a;. ’ 

2a; — 4 z — 5, 

■ 2z = 1. ' 


v ' *i - 3»/ 2 - 7. '’ ' ' x 

3. By expanding out all of the determinants, verify the following results: 


a i 

h 


a i 

ci 2 



bi 

di 

a 2 

b 2 


bi 

k 2 



k 2 

d 2 


(b) 


d\ 

bi 


d i + Ci 

bi 


Cl 

bi 

d 2 

b 2 


d 2 + C 2 

b 2 


c 2 

h 


a i b\ 

a 2 b 2 


d\ — kbi b\ 
a 2 - kb 2 b 2 


for any number, k. 
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7.1.5 ANSWERS TO EXERCISES 


1. The values are: 

(a) 5; (b) 5; (c) -1; 

(d) x 2 — y 2 ; (e) 0; (f) a 2 + b 2 . 


-276 621 -230’ 

hence, x = 1.2 and y = 2.7; 

(b) 

a: — y 1 

—58 “ ^87 “ —29’ 

hence, x = 2 and y = —3; 

(c) 

x —y 1 

22 “ ~~ ^IT’ 

hence, x = — 2 and y = — 1; 

(d) 

a: — y 1 

^26 ~~ 1” ~~ ^7’ 

hence, a: = y and y — 

(e) 

*1 _ -*2 _ 1 
A29 “ —16 ~~ ^11’ 

hence, = ff and i 2 = ; 

(f) 

x —z 1 

—6 — — o’ 

which means that Cramer’s rule breaks down. In fact, the second equation gives two 
contradictory statements 2x — 4z = 5 and 2a; — 4 z = 2. 


3. By following the instructions, the results may be verified. 
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UNIT 7.2 - DETERMINANTS 2 


CONSISTENCY AND THIRD ORDER DETERMINANTS 

7.2.1 CONSISTENCY FOR THREE SIMULTANEOUS LINEAR EQUATIONS 
IN TWO UNKNOWNS 

In a genuine scientific problem involving simultaneous equations, it is not necessarily true 
that there will be the same number of equations to solve as there are unknowns to be 
determined. We examine, here, an elementary situation in which there are three equations, 
but only two unknowns. 

Consider the set of equations 


aiX + biy + Ci 

a 2 x + b 2 y + c 2 
a 3 x + b 3 y + c 3 


0 . ( 1 ) 

0 , ( 2 ) 

0, (3) 


where we shall assume that any pair of the three equations has a unique common solution 
which may be calculated by Cramer’s Rule. 

In order that the three equations shall be consistent, the common solution of any pair must 
also satisfy the remaining equation. In particular, the common solution of equations (2) and 
(3) must also satisfy equation (1). 

By Cramer’s Rule in equations (2) and (3), 


X 


- 

■y 


1 

b 2 c 2 


Q'2 

c 2 


a 2 b 2 

h c 3 


a 3 

c 3 


a 3 b 3 


This solution will satisfy equation (1) provided that 


b 2 c 2 


a 2 

c 2 

b 3 c 3 

. 

a 3 

c 3 

a 2 b 2 

U 1 

a- 2 b 2 

a 3 b 3 


a 3 b 3 


In other words, 


1 



0. 


a i 


b2 

h 


c 2 

c 3 


-h 


(l 2 
03 


c 2 

c 3 


+ Cl 


a 2 

a 3 


& 2 

h 


This is the determinant condition for the consistency of three simultaneous linear equations 
in two unknowns. 

7.2.2 THE DEFINITION OF A THIRD ORDER DETERMINANT 

In the consistency condition of the previous section, the expression on the left-hand-side is 
called a “determinant of the third order” and is denoted by the symbol 


dl 

bi 

Cl 

0,2 

b 2 

C 2 

a 3 

h 

C 3 


It has three “rows” (horizontally), three “columns” (vertically) and nine “elements” 
(the numbers inside the determinant). 

The definition of a third order determinant may be stated in the form 


0\ 

bi 

Cl 

a 2 

b 2 

c 2 

a 3 

h 

c 3 



c 2 

c 3 



c 2 

c 3 


+ Cl 


a 2 

a 3 


b>2 

h 


Notes: 

(i) Other forms of the definition are also possible and will be encountered in Unit 7.3 

(ii) The given formula for evaluating a third order determinant can be remembered by 
taking each element of the first row in turn and multiplying it by the so-called “minor” 
of the element, which is the second order determinant obtained by covering up the row and 
column in which the element appears; the results are then combined according to a +, — , + 
pattern. 

(iii) For the purpose of locating various parts of a determinant, the rows are counted from 
the top to the bottom and the columns are counted from the left to the right. Each row is 
read from the left to the right and each column is read from the top to the bottom. Thus, 
for example, the third element of the second column is & 3 . 
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EXAMPLES 


1. Evaluate the determinant 


Solution 



7 

-2 . 
3 


That is, 


A = -3 


4 

-1 




4 

-1 ' 


A = -3(12 - 2) - 2(0 + 10) + 7(0 - 20) = -190. 


2. Show that the simultaneous linear equations 


3a; — y + 2 = 0, 
2a; + by — 1 = 0, 
5a; + Ay + 1 = 0 


are consistent (assuming that any two of the three have a common solution), and obtain 
the common solution. 

Solution 

The condition for consistency is that the determinant of coefficients and constants must 
be zero. We have 


3-12 
2 5-1 

5 4 1 




5 

4 


= 3(5 + 4) + (2 + 5) + 2(8 - 25) = 27 + 7 - 34 = 0. 

Thus, the equations are consistent and, to obtain their common solution, we may solve 
(say) the first two as follows: 


X 


-y 


1 

-1 2 


3 2 


3 -1 

5 -1 


2 -1 


2 5 


3 



That is, 


which gives 


x —y 1 
—9 _ —7 _ 17’ 


x = and y 

17 


7 

17 


Note: 

It may be observed that the given set of simultaneous equations above is not an independent 
set because the third equation happens to be the sum of the other two. We say that the 
equations are “linearly dependent”; and this implies that the rows of the determinant of 
coefficients and constants are linearly dependent in the same way. 

(In this case, Row 3 = Row 1 plus Row 2). 

Furthermore, it may shown that the value of a determinant is zero if and only if its rows are 
linearly dependent. Hence, an alternative way of proving that a set of simultaneous linear 
equations is a consistent set is to show that they are linearly dependent in some way. 

7.2.3 THE RULE OF SARRUS 

From the given definition of a third order determinant, the complete “expansion” of the 
determinant may be given, in general, as 


CL\ 

bi 

Cl 


b 2 

c 2 

a 3 

b 3 

c 3 


a\{b 2 c 3 - b 3 c 2 ) - bi(a 2 c 3 - a 3 c 2 ) + ci(a 2 b 3 - a 3 b 2 ) 


But it may be observed that precisely the same terms may be obtained by first constructing 
a diagram which consists of the original determinant with the first two columns written out 
again to the right of this determinant. That is: 


CL\ 

b\ 

Cl 

a i 

bi 

&2 

b 2 

c 2 

a 2 

b 2 

a 3 

b 3 

c 3 

a 3 

O- 

co 


Taking the sum of the possible products of the trios of numbers in the direction \ and 
subtracting the sum of the possible products of the trios of numbers in the direction, we 
obtain the terms 
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(aib 2 c 3 + b\c 2 a 3 + cia 2 b 3 ) - ( a 3 b 2 ci + b 2 c 3 a\ + c 3 a 2 b \ ); 


and it may be shown that these are exactly the same terms as those obtained by the original 
formula. 

This “Rule of Sarrus” makes it possible to evaluate a third order determinant with an 
electronic calculator, almost without putting pen to paper, provided the calculator memory 
is used to store, then recall, the various products. 

EXAMPLE 


-3 

2 

7 


-3 

2 

7 

0 

4 

-2 

= 

0 

4 

-2 

5 

-1 

3 


5 

-1 

3 


= ([— 3 ]. 4. 3 + 2 . [— 2], 5 + 7 . 0 . [— 1 ]) - ( 5 . 4.7 + [- 1 ], [- 2 ], [- 3 ] + 3 . 0 . 2 ) 


= (-36 - 20 + 0 ) - (140 - 6 + 0 ) = -56 - 134 = -190 
as in a previous example. 

7.2.4 CRAMER’S RULE FOR THREE SIMULTANEOUS LINEAR 
EQUATIONS IN THREE UNKNOWNS 

In the same way that, under certain conditions, two simultaneous linear equations may be 
solved by determinants of the second order, it is possible to show that, under certain condi- 
tions, three simultaneous linear equations in three unknowns may be solved by determinants 
of the third order. 

The proof of this result will not be included here , but we state it for reference. 

The simultaneous linear equations 


a i x + biy + C\Z + d\ = 0, 

a 2 x + b 2 y + c 2 z + d 2 = 0, 

a 3 x + b 3 y + c 3 z + d 3 = 0, 
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have a common solution, given in symmetrical form, by 


x —y z — 1 


bi 

Cl 

d\ 


CL\ 

Cl 

d\ 


a i 

bi 

d\ 


ai 

bi 

Cl 

b 2 

c 2 

d 2 


0,2 

c 2 

d 2 


a 2 

b 2 

d 2 


a 2 

b 2 

c 2 

CO 

c 3 

d 3 


a 3 

c 3 

d-i 


a 3 

b-3 

d,i 


a 3 

CO 

c 3 


x —y z —1 

Ai A 2 A 3 A 0 ’ 

which is called the “Key” to the solution and requires that A 0 ^ 0. 

Again the rule itself is known as “Cramer’s Rule”. 

EXAMPLE 

Using the Rule of Sarrus, obtain the common solution of the simultaneous linear equations 


Solution 

The “Key” is 


where 

i 


Hence, 


x + Ay — z + 2 = 0, 
— x — y + 2z — 9 = 0, 
2x + y — 3z + 15 = 0. 


x —y z —1 

Ai A 2 A 3 A 0 ’ 



-1 

2 

-3 


1 

-1 

2 


4 

-1 

1 
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A 0 = (3 + 16 + 1) - (2 + 2 + 12) = 20 - 16 = 4, 


which is non-zero, and so we may continue: 


ii 



-1 

2 

-3 




Hence, 


A x = (120 + 9 + 6) - (4 + 108 + 15) = 135 - 127 = 8. 


(iii) 


Hence, 



(iv) 


Hence, 


A 2 = (30 + 18 + 6) - (8 + 27 + 15) = 54 - 50 = 4. 


A3 — 


1 

-1 

2 


4 

-1 

1 



4 

-1 

1 


A 3 = (-15 - 72 - 2) - (-4 - 9 - 60) = -89 + 73 = -16. 


(v) The solutions are therefore 


_Ai = _8 = _ 2 . 
A 0 4 
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A 2 4 

V= AT 4 = 1: 


z = 


A3 

Aq 


-16 


= 4. 


Special Cases 

If it should happen that A 0 = 0 when solving a set of three simultaneous linear equations 
by Cramer’s Rule, earlier work has demonstrated that the rows of A 0 must be linearly 
dependent. That is the three groups of x, y and z terms must be linearly dependent. 

Different situations arise according to whether or not the constant terms can also be brought 
in to the linear dependence relationship and we illustrate with examples as follows: 

EXAMPLES 

1. For the simultaneous linear equations 


2x — y + 3z — 5 = 0, 

x + 2y — z — 1 = 0, 

x — 3y + Az — 4 = 0, 

the third equation is the difference between the first two and hence it is redundant. 

Any solution common to the first two equations will thus be an acceptable solution. 
In this case, there will be an infinite number of solutions since, for example, we may 
choose the variable z at random, solving for x and y to obtain 

11-5 z , 5z — 3 

x = and y = — - — . 

5 5 

2. For the simultaneous linear equations 

2x — y + 3z — 5 = 0, 

x + 2y — z — 1 = 0, 

x — 3y + Az — 7 = 0, 
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the third equation is inconsistent with the difference between the first two equations. 
That is, 


x — 3y + Az — 7 = 0 is inconsistent with x — 3y + Az — 4 = 0. 

In this case, there are no common solutions. 

3. For the simultaneous linear equations 

x — 2y + 3z — 1 = 0, 

2x — Ay + 6z — 2 = 0, 

3x — 6y + 9z — 3 = 0, 

we have only one independent equation since the second and third equations are mul- 
tiples of the first equation. 

Again, there will be an infinite number of solutions which may be obtained by choos- 
ing two of the variables at random, then determining the corresponding value of the 
remaining variable. 

Summary of the special cases 

If A 0 = 0, further investigation of the simultaneous linear equations is necessary. 

7.2.5 EXERCISES 

1. Show that the simultaneous linear equations 

x + y + 2 = 0, 

3a; + 2y — 1 = 0, 

2a; + y — 3 = 0, 

are consistent and determine their common solution. 

2. Show that the simultaneous linear equations 

7x — 2y + 1 = 0, 

3x + 2y — A = 0, 
x — 6y — 9 = 0, 

are inconsistent. 
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3. Obtain the values of A for which the simultaneous linear equations 


3x + 5y + (\ - 2) = 0, 

2x + y — 5 = 0, 

(A - l)x + 2y - 10 = 0, 

are consistent. 

4. Use Cramer’s Rule to solve, for x, y and z, the following simultaneous linear equations: 

5a; + 3y — z + 10 = 0, 

—2a; — y + 4z — 1 = 0, 

-x + 2y - 7z - 17 = 0. 

5. Show that the simultaneous linear equations 

x — y + 7z — 1 = 0, 
x + 2y — 3z + 5 = 0, 

5a; + Ay + 5z + 13 = 0 

are linearly dependent and obtain the common solution for which z = —1. 

7.2.6 ANSWERS TO EXERCISES 

1. 

x = 5 y — —7. 

2. 

A 0 ^ 0. 

3. 

A = 5 or A = -23. 

4. 

x = — 4 y = 3 z — —1. 

5. 

8 16 

x = - y — z — —1. 

3 y 3 
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UNIT 7.3 - DETERMINANTS 3 


FURTHER EVALUATION OF 
THIRD ORDER DETERMINANTS 

7.3.1 EXPANSION BY ANY ROW OR COLUMN 

For the numerical evaluation of a third order determinant, the Rule of Sarrus is the easiest 
rule to apply; but we examine here some alternative versions of the original definition formula 
which will lead us to important standard properties of determinants. 

Let us first re-state the original definition formula as follows: 


ai 

h 

Cl 

a 2 

b 2 

c 2 

a 3 

b 3 

C 3 



c 2 

c 3 



c 2 

c 3 


+ Cl 


(l 2 
a 3 


b 2 

b 3 


The question naturally arises as to whether the three elements of other rows (or even columns) 
may multiplied by their minors and the results combined in such a way as to give the same 
result as in the above formula. It turns out that any row or any column may be used in this 
way. 

In order to illustrate this fact, we state again the more algebraic formula for a third order 
determinant, namely 


CL\ 

bi 

Cl 


b 2 

c 2 

a 3 

b 3 

c 3 


a\{b 2 c 3 - b 3 c 2 ) - bi(a 2 c 3 - a 3 c 2 ) + ci(a 2 & 3 - a 3 b 2 ). 


ILLUSTRATION 1 - Expansion by the second row. 

It may be observed that the expression 


-a 2 (6ic 3 - b 3 ci) + b 2 {a\c 3 - a 3 c 3 ) - c 2 (ai& 3 - a 3 b 3 ) 
gives exactly the same result as in the original formula. 
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ILLUSTRATION 2 - Expansion by the third column. 

It may be observed that the expression 


ci(a 2 &3 - a 3 b 2 ) - c 2 (ai& 3 - a 3 bi) + c 3 (ai& 2 - a 2 b 1 ) 

gives exactly the same result as in the original formula. 

Note: 

Similar patterns of symbols give the expansions by the remaining rows and columns. 

Summary 

A third order determinant may be expanded (that is, evaluated) if we multiply each of the 
three elements in any row or (any column) by its minor then combined the results according 
the following pattern of so-called “place-signs” . 

+ - + 

- + - . 

+ - + 


Note: 

It is useful to have a special name for the “signed-minor” which any element of a deter- 
minant is mutiplied by, when expanding it by a row or a column. In fact every signed-minor 
is called a “cofactor”. This means that, wherever the place-sign is +, the minor and the 
cofactor are the same; but, wherever the place-sign is — , the cofactor is numerically equal 
to the minor but opposite in sign. 


For instance, 


(i) The minor of b\ is 


0 2 c 2 

03 C3 


but the cofactor of b\ is 


0-2 C 2 
O3 C3 


(ii) The minor and cofactor of b 2 are both equal to 


a\ 

03 


Cl 

c 3 
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7.3.2 ROW AND COLUMN OPERATIONS ON DETERMINANTS 


INTRODUCTION 

Certain types of problem in scientific work can involve determinants for which some or all of 
the elements are variable quantities rather than fixed numerical quantities. In these cases, 
the methods so far encountered for expanding a determinant may not be appropriate. 

Described below is a set of standard properties for determinants of any order but, where 
necessary, they will be explained using either 3x3 determinants or 2 x 2 determinants. 

STANDARD PROPERTIES OF DETERMINANTS 

In this section, the methods of expanding a determinant by any row or any column will be 
useful to have in mind. 


1. If all of the elements in a row or a column have the value zero, then the value of the 
determinant is equal to zero. 

Proof: 

We simply expand the determinant by the row or column of zeros. 

2. If all but one of the elements in a row or column are equal to zero, then the value of 
the determinant is the product of the non- zero element in that row or column with its 
cofactor. 

Proof: 

We simply expand the determinant by the row or column containing the single non- 
zero element; and we also notice that the determinant is effectively equivalent to a 
determinant of one order lower. 

For example, 


5 1 0 

-2 4 3 

6 8 0 


-3 


5 1 

6 8 


-3(40 - 6) = -102. 


3. If a determinant contains two identical rows or two identical columns, then the value of 
the determinant is zero. 

Proof: 

If we expand the determinant by a row or column other than the two identical ones, it 
will turn out that all of the cofactors have value zero. 

For example, 


3 



0 . 


12 3 
4 5 6 
12 3 





2 

2 


4. If two rows, or two columns, are interchanged the value of the determinant is unchanged 
numerically but it is reversed in sign. 

Proof: 

If we expand the determinant by a row or column other than the two which have been 
interchanged, then all of the cofactors will be changed in sign. 

For example, 


CL\ 

Cl 

lh 


c 2 

b 2 

a 3 

c 3 

b 3 



b 2 

h 


— a 2 


Cl 

c 3 


h 

h 


+ « 3 


Cl 

C 2 


h 

b 2 


5. If all of the elements in a row or column have a common factor, then this common factor 
may be removed from the determinant and placed outside. 

Proof: 

Expanding the determinant by the row or column which contains the common factor 
is equivalent to removing the common factor Erst, then expanding by the new row or 
column so created. 

For example, 


O] 

kb x 

Cl 


(l\ 

bi 

Cl 

a 2 

kb 2 

c 2 

= k 

a 2 

b 2 

c 2 

a 3 

kb 3 

C 3 


a 3 

b 3 

c 3 


if we expand the left- hand-side by the second column. 

Note: 

Another way of stating this property is that, if all of the elements in any row or column 
of a determinant are multiplied by the same factor, then the value of the determinant 
is also multiplied by that factor. 

6. If the elements of any row in a determinant are altered by adding to them (or subtracting 
from them) a common multiple of the corresponding elements in another row, then the 
value of the determinant is unaltered. A similar result applies to columns. 

ILLUSTRATION 

The validity of this result is easily shown in the case of 2 x 2 determinants as follows: 


a i + kb\ b\ 
a 2 + kb 2 b 2 


[(cq + kbi)b 2 - ( a 2 + kb 2 )bi] = aib 2 - a 2 b x 


a\ b\ 
a 2 b 2 
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The above properties need not normally be used for the evaluation of determinants whose 
elements are simple numerical values; but, in the examples which follow, we include one such 
determinant in order to provide a simple introduction to the technique. 

We shall use the symbols R { , R 2 and R 3 to denote Row 1, Row 2 and Row 3; the symbols C\, 
C 2 and C 3 will be used to denote Column 1, Column 2 and Column 3; and the symbol — > 
will stand for the word “becomes”. The examples use what are called “row operations” 
and “column operations”. 

EXAMPLES 


1. Evaluate the determinant, 

1 15 7 

2 25 9 

3 10 3 

Solution 

1 15 7 

2 25 9 Ci — * Ci 5; 

3 10 3 


1 3 7 

5 2 5 9 R 2 — ♦ R 2 -2Ri, 
3 2 3 


1 3 7 

5 0-1-5 R,i — ♦ R 3 — 3Ri, 
3 2 3 


1 3 7 

0 -1 -5 

0 -7 -18 


= 5(18 -35) = 5 x -17 = -85. 
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2 . Solve, for x, the equation 


a; 5 3 

5 x+1 1 =0. 

-3 -4 x-2 

Solution 

We could expand the determinant directly, but we would then obtain a cubic equation 
in x which may not be straightforward to solve. 

A better method is to try to obtain factors of this cubic equation before expanding the 
determinant. 

It may be observed in this example that the three expressions in each column add up 
to the same quantity, namely x + 2. Thus if we first add Row 2 to Row 1, then add 
Row 3 to the new Row 1, we shall obtain x + 2 as a factor of the first row. 

We may write 


X 

5 

3 


5 

x + 1 

1 i?! — 

R\ + R2 + R3 

-3 

-4 

x — 2 


x + 2 

x + 2 

x + 2 


5 

x + 1 

1 R 1 - 

— * R\ + (x + 2 ) 

-3 

-4 

x — 2 



111 

= (x + 2) 5 x + 1 1 C 2 — * C2 — Ci and C 3 — ■> C 3 — C\ 

-3 -4 x-2 

1 0 0 
= (x + 2) 5 x — 4 —4 

-3 -1 x + 1 


Hence, 


= (x + 2)[(x — 4)(x + 1) — 4] = (x — 2)(x 1 2 — 3x — 8 ). 


x = — 2 or x 



3 ± 

2 
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3. Solve, for x, the equation 


x — 6 —6 x — 5 

2 x + 2 1 = 0 , 

7 8 x + 7 

Solution 

We may observe that the sum of the corresponding pairs of elements in the first two 
rows is the same, namely x — 4. Hence we may proceed as follows: 

x — 6 —6 x — 5 

0= 2 x + 2 1 R\ — * Rt + Rz 

7 8 x + 7 

x — 4 x — 4 x — 4 

2 x + 2 1 Rj — * Rj_ + (x - 4) 

7 8 x + 7 

11 1 

= (x - 4) 2 x + 2 1 C 2 — ♦ C 2 - Ci and C 3 — » C 3 - C\ 

7 8 x + 7 

1 0 0 
= (x — 4) 2 x —1 
7 1 x 


= (x — 4)(x 1 2 + 1). 

In this case, the only real solution is x = 4, the others being complex numbers x — +j. 
4. Solve, for x, the equation 

x 3 2 

4 x + 4 4 , 

2 1 x — 1 

Solution 

We may observe that the 2 in Row 1 may be used to reduce to zero the 4 underneath 
it in Row 2. 
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Hence, 


a; 3 2 

0=4 x + 4 4 R -2 — > ^2-2^1 

2 1 x- 1 


a: 3 2 

= 4 -2a: x - 2 0 i? 2 — » R 2 + (x - 2) 

2 1 x - 1 


a; 3 2 

= (x - 2) -2 1 0 Ci — * Ci + 2C 2 

2 1 x-1 


x + 6 3 2 

= (x - 2) 0 1 0 

4 1 x — 1 


= (x — 2)[(x + 6)(x — 1) — 8] = (x — 2)[x 2 + 5x — 14] = (x — 2)(x + 7)(x — 2). 
Thus, 

x = 2 (repeated) and x = —7. 


Note: 

It is not possible to cover, by examples, every type of problem which may occur. The secret 
is first to spend a few seconds examining whether or not the sum or difference of a group of 
rows or columns can give a common factor immediately. If not, the procedure is to look for 
ways of obtaining a row or column in which all but one of the elements is zero and hence, 
effectively, to reduce the order of the determinant. 
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7.3.3 EXERCISES 


1. Use row and/or column operations to evaluate the following determinants: 
(a) 


100 

101 

102 

101 

102 

103 

102 

103 

104 


(b) 

1! 2! 3! 

2! 3! 4! . 

3! 4! 5! 

2. Use row and/or column operations to evaluate, in terms of a and 6, the determinant 

11 1 

1 1+a 1 . 

1 1 1 + 6 


3. Show that the equation 


x a b 
a x b 
a b x 


0 


has one solution x = — (a + 6) and hence solve it completely. 


4. Solve completely, for x, the following equations: 


(a) 


(b) 


x — 3 

x + 2 

x — 1 


x + 2 

x — 4 

X 

= 0 

x — 1 

x + 4 

x — 5 



x + 1 x + 2 3 

2 x + 3 x + 1 
x + 3 1 x + 2 


0 . 
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7.3.4 ANSWERS TO EXERCISES 


1. (a) 

0 

(b) 

24 

2 . 


ab 


3. 

x = — (a + b), x = a, x = b. 

4. (a) 

x = ^ only; 

(b) 


x = —3, x = ±V 3. 
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UNIT 7.4 - DETERMINANTS 4 


HOMOGENEOUS LINEAR EQUATIONS 

7.4.1 TRIVIAL AND NON-TRIVIAL SOLUTIONS 

This Unit is concerned with a set of simultaneous linear equations in which all of the constant 
terms have value zero. Most of the discussion will involve three such “homogeneous” linear 
equations of the form 


a\x + biy + c\z = 0 , 
a 2 x + b 2 y + c 2 z = 0, 
a 3 x + b 3 y + c 3 z = 0. 


These could have been discussed at the same time as Cramer’s Rule but are worth con- 
sidering as a completely separate case since, in scientific applications, they lend themselves 
conveniently to the methods of row and column operations. 

Observations 

1. In Cramer’s Rule for the above set of equations, if the determinant, A 0 of the coefficients 
of x, y and z is non-zero, there will exist a unique solution, namely x = 0, y = 0, z = 0, 
since each of the determinants Ai, A2 and A3 will contain a column of zeros (that is, 
the constant terms of the three equations). 

But this solution is obvious from the given set of equations and we call it the “trivial 
solution” . 

2. The question arises as to whether it is possible for the set of equations to have any 

“non-trivial” solutions. 

3. We shall see that non-trivial solutions occur when the number of equations reduces to 
less than the number of variables being solved for; that is when the equations are not 
linearly independent. 

For example, if one of the equations is redundant, we could solve the remaining two in 
an infinite number of ways by choosing one of the variables at random. Also, if two 
of the equations are redundant, we could solve the remaining equation in an infinite 
number of ways by choosing two of the variables at random. 

4. It is evident from previous work that the set of homogeneous linear equations will have 
non-trivial solutions provided that 


1 



«1 

bi 

Cl 

a 2 

b -2 

c 2 

0,3 

O* 4 

CO 

c 3 


5. Once it has been established that non-trivial solutions exist, it can be seen that any 
solution x = a, y = P, z = j will imply other solutions of the form x = Aa, y = \/ 3 , 
z = Ay, where A is any non-zero number. 


TYPE 1 - One of the three equations is redundant 

The non-trivial solutions to a set which reduces to two linearly independent homogeneous 
linear equations in x, y and z may be stated in the form 


x : y : z — a : P : 7, 


in which we mean that 


x 

y 


a y 
P' z 



X 

z 


a 

1 


One method of obtaining these ratios is first to eliminate z between the two equations in 
order to obtain the ratio x : y, then to eliminate y between the two equations in order to 
find the ratio x : z\ but a slightly simpler method is described in the first worked example 
to be discussed shortly. 

TYPE 2 - Two of the three equations are redundant 

This case arises when the three homogeneous linear equations are multiples of one another. 

Again, any solution implies an infinite number of others in the same set of ratios, x : y : z. 
But it turns out that not all solutions are in the same set of ratios. 

For example, if the only equation remaining is 


ax + by + cz = 0, 

we could choose any two of the variables at random and solve for the remaining variable. 
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In particular, we could substitute y — 0 to obtain x : y : z = : 0:1; 

and we could also substitute z = 0 to obtain x : y : z — : 1:0 

From these two, it is now possible to generate solutions with any value, /?, of y and any 
value, 7, of z (as if we had chosen y and z at random) in order to solve for x. 

In fact 

b c 

x = -(/3.-+ 7.-), y = /3, z = 7. 
a a 


Note: 

It may be shown that, for a set of homogeneous linear simultaneous equations, no types of 
solution exist other than those discussed above. 

EXAMPLES 

1. Show that the homogeneous linear equations 


2 x + y — z = 0, 
x — 3y + 2z = 0, 
x + Ay — 3z = 0 


have solutions other than a; = 0, y = 0, z = 0 and determine the ratios x : y : z for 
these non-trivial solutions. 

Solution 


An — 


2 1 
1 -3 
1 4 


2 1 

1 -3 = (18 + 2 

1 4 


4) -(3 + 16 -3) = 0. 


Thus, the equations are linearly dependent and, hence, have non-trivial solutions. 

Note: 

We could, alternatively, have noticed that the first equation is the sum of the second 
and third equations. 


3 



(b) It can always be arranged, in a set of ratios a : f3 : 7, that any of the quantities 
which does not have to be equal to zero may be given the value 1. For example, ^ ^ : 1 

is the same set of ratios as long as 7 is not zero. 

Let us now suppose that z = 1, giving 

2x + y — 1 = 0, 
x — 3y + 2 = 0, 
x + Ay — 3 = 0 

On solving any pair of these equations, we obtain x = | and y — |, which means that 

1 5 1 

x : y : z — - : - : 1 

7 7 

That is, 

a; : y : z = 1 : 5 : 7 

and any three numbers in these ratios form a solution. 

2. Determine the values of A for which the homogeneous linear equations 


(1 — \)x + y — 2z = 0, 
-x + (2 - A)y + z = 0, 
y-(l-X)z = 0 


have non-trivial solutions. 

Solution 

First we solve the equation 


1 - A 1 -2 

0 = |-1 2 - A 1 

0 1 — 1 — A) 


Ri — » R\ — i?3 


1 — A 0 — 1 + A 


-1 2 - A 


1 

—1 — A 


R 1 — > R 1 4- (1 - A) 
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10-1 

= (1 — A) — 1 2 — A 1 ^3 — Cg + Ci 

0 1 -1 - A 


1 0 0 

= (1- A) -1 2- A 0 

0 1 -1 - A 



3a; — 7y + z = 0. 

Solution 

Substituting y = 0, we obtain 3a; + z = 0 and hence x : y : z — — | : 0 : 1. 

Substituting z = 0, we obtain 3a; — 7y = 0 and hence x : y : z = | : 1 ; 0 
The general solution may thus be given by 

7/^7 a 

x = . y = p, z = 7 

3 3 , y / 

for arbitrary values of f3 and 7 , though other equivalent versions are possible according 
to which of the three variables are chosen to have arbitrary values. 
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7.4.2 EXERCISES 


1. Show that the homogeneous linear equations 

x — 2y + 2z = 0, 

2a; — 2 y — z = 0, 

3 x + y + z = 0 

have no solutions other than the trivial solution. 

Show that the following sets of homogeneous linear equations have non-trivial solutions 
and express these solutions as a set of ratios for x : y : z 

(a) 

x — 2y + z = 0, 
x + y — 3z = 0, 

3a; — 3 y — z = 0; 

(b) 

3x + y — 2z = 0, 

2x + Ay + 2z = 0, 

Ax + 3y — z = 0. 

Determine the values of A for which the homogeneous linear equations 

Xx + 2y + 3z = 0, 

2a; + (A + 3 )y + 6 z = 0, 

3a; + Ay + (A + 6 )z = 0 

have non-trivial solutions and solve them for the case when A is an integer. 

4. Determine the general solution to the homogeneous linear simultaneous equations 

(A + A)x — 5y + 3^ = 0, 

—2a; + (A — 8 )y + 6z = 0, 

—3a; - 15y + (A + ll)z = 0 

in the case when A = —2. 


2 . 
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7.4.3 ANSWERS TO EXERCISES 


1 . 


2. (a) 


(b) 


3. 


A 0 ^ 0. 


x : y : z = 5:4:3; 
x : y : z — 1 : — 1 : 1 
A = 1, 0.83 or - 10.83 


When A = 1, x : y : z — — 1 : — 1 : 1 
4. 


£ = - 5/3 + 37 , V = P, z = r ). 
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UNIT 8.1 - VECTORS 1 - INTRODUCTION TO VECTOR ALGEBRA 


8.1.1 DEFINITIONS 

1. A “scalar” quantity is one which has magnitude, but is not related to any direction in 
space. 

Examples: Mass, Speed, Area, Work. 

2. A “vector” quantity is one which is specified by both a magnitude and a direction in 
space. 

Examples: Velocity, Weight, Acceleration. 

3. A vector quantity with a fixed point of application is called a “position vector” . 

4. A vector quantity which is restricted to a fixed line of action is called a “line vector” . 

5. A vector quantity which is defined only by its magnitude and direction is called a “free 
vector” . 

Note: 

Unless otherwise stated, all vectors in the remainder of these units will be free vectors. 

6. A vector quantity can be represented diagramatically by a directed straight line segment 
in space (with an arrow head) whose direction is that of the vector and whose length 
represents its magnitude according to a suitable scale. 

7. The symbols a, b, c, will be used to denote vectors with magnitudes a,b,c.... but 

it is sometimes more convenient to use a notation such as AB which means the vector 
represented by the line segment drawn from the point A to the point B. 

Notes: 

(i) The magnitude of the vector AB . which is the length of the line AB can also be 
denoted by the symbol | AB |. 

(ii) The magnitude of the vector a, which is the number a, can also be denoted by the 
symbol |a|. 

B 



8. A vector whose magnitude is 1 is called a “unit vector” and the symbol a denotes a 
unit vector in the same direction as a. A vector whose magnitude is zero is called a 
“zero vector” and is denoted by O or O. It has indeterminate direction. 

9. Two (free) vectors a and b are said to be “equal” if they have the same magnitude 
and direction. 

Note: 

This means that two directed straight line segments which are parallel and equal in 
length may be regarded as representing exactly the same vector. 


1 



10. A vector whose magnitude is that of a but with opposite direction is denoted by — a. 

8.1.2 ADDITION AND SUBTRACTION OF VECTORS 

Students may already know how the so-called “resultant” (or sum) of particular vectors, 
like forces , can be determined using either the “Parallelogram Law” or alternatively the 
“Triangle Law” . This previous knowledge is not essential here because we now define the 
sum of two arbitrary vectors diagramatically using either a parallelogram or a triangle. This 
will then lead also to a definition of subtraction for two vectors. 



Notes: 

(i) The Triangle Law is more widely used than the Parallelogram Law because of its sim- 
plicity. We need to observe that a and b describe the triangle in the same sense while 

a + b describes the triangle in the opposite sense. 

(ii) We define subtraction for vectors by considering that 

a — b = a + (— b). 

For example, to find a — b for the vectors a and b below, 




we may construct the following diagrams: 
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The third figure shows that, to find a — b, we require that a and b describe the triangle in 
opposite senses while a — b describes the triangle in the same sense as b 

(iii) The sum of the three vectors describing the sides of a triangle in the same sense is always 
the zero vector. 

8.1.3 MULTIPLICATION OF A VECTOR BY A SCALAR 

If m is any positive real number, ma is defined to be a vector in the same direction as a, but 
of m times its magnitude. 

Similarly -ma is a vector in the opposite direction to a, but of m times its magnitude. 

Note: 

a = aa and hence 

1 

— — ci. 

a 

That is, if any vector is multiplied by the reciprocal of its magnitude, we obtain a unit vector 
in the same direction. This process is called “normalising the vector” . 

8.1.4 LAWS OF ALGEBRA OBEYED BY VECTORS 

(i) The Commutative Law of Addition 


a + b = b + a. 

(ii) The Associative Law of Addition 

a + (b + c) = (a + b) + c = a + b + c. 

(iii) The Associative Law of Multiplication by a Scalar 

m(n&) = (ran) a = rona. 
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(iv) The Distributive Laws for Multiplication by a Scalar 


(m + n) a = ma + na 

and 

m( a + b) = ma + mb. 

8.1.5 VECTOR PROOFS OF GEOMETRICAL RESULTS 

The following examples illustrate how certain geometrical results which could be very cum- 
bersome to prove using traditional geometrical methods can be much more easily proved 
using a vector method. 

EXAMPLES 

1. Prove that the line joining the midpoints of two sides of a triangle is parallel to the 
third side and equal to half of its length. 

Solution 


C 



By the Triangle Law, 


BC = BA + AC 


and 

MN = MA + AN = -BA + -AC. 
^ 2 2 

Hence, 

MN = ^ (BA + AC) = ^BC, 

which proves the result. 
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2. ABCD is a quadrilateral (four-sided figure) and E,F,G,H are the midpoints of AB, BC, 
CD and DA respectively. Show that EFGH is a parallelogram. 

Solution 



By the Triangle Law, 


EF = EB + BF = 1 AB + -BC = 1 (AB + BC) = 1 AC 
— — — 2 2 2 v ’ 2 

and also 

HG = HD + DG = ^AD + ^DC = ^ (AD + DC) = ^AC. 


Hence, 


EF = HG. 


which proves the result. 
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8.1.6 EXERCISES 


1. Which of the following are vectors and which are scalars ? 
(a) Kinetic Energy; (b) Volume; (c) Force; 

(d) Temperature; (e) Electric Field; (f) Thrust. 

2. Fill in the missing arrows for the following vector diagrams: 



3. ABODE is a regular pentagon with centre O. Use the Triangle Law of Addition to show 
that 

AB + BO + CD + DE + EA = O. 

4. Draw to scale a diagram which illustrates the identity 

4a + 3(b — a) = a + 3b. 

5. a, b and c are any three vectors and 

P = b + c — 2a, q = c + a — 2b, r = 3c — 3b. 

Show that the vector 3p — 2q is parallel to the vector 5p — 6q + r. 
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8.1.7 ANSWERS TO EXERCISES 


1. (a) Scalar; (b) Scalar; (c) Vector: 

(d) Scalar; (e) Vector; (f) Vector. 

2. The completed diagrams are as follows: 



3. Join A,B,C,D and E up to the centre, O. 

4. The diagram is 
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UNIT 8.2 - VECTORS 2 - VECTORS IN COMPONENT FORM 


8.2.1 THE COMPONENTS OF A VECTOR 

The simplest way to define a vector in space is in terms of unit vectors placed along the 
axes Ox, 0 y and Oz of a three-dimensional right-handed cartesian reference system. These 
unit vectors will be denoted respectively by i, j and k (omitting, for convenience, the “bars” 
underneath and the “hats” on the top). 

Consider the following diagram: 



In the diagram, OA = i, OB = j and OC = k. P is the point with co-ordinates (x, y, z). 
By the Triangle Law 

r = OP = QQ. + QP = OR + RQ + QP. 


That is, 


r = x\ + |/j + z k. 


Notes: 

(i) The fact that we have considered a vector which emanates from the origin is not a special 
case since we are dealing with free vectors. Nevertheless OP is called the position vector of 
the point P. 

(ii) The numbers x, y and z are called the “components” of OP (or of any other vector in 
space with the same magnitude and direction as OP ). 

(iii) To multiply (or divide) a vector in component form by a scalar, we simply multiply 
(or divide) each of its components by that scalar. 
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8.2.2 THE MAGNITUDE OF A VECTOR IN COMPONENT FORM 

Referring to the diagram in section 8.2.1, Pythagoras’ Theorem gives 
(OP) 2 = (OQ) 2 + (QP) 2 = (OR) 2 + (RQ) 2 + (QP) 2 . 

That is, 

r = |o;i + yj + zk\ = \Jx 2 + y 2 + z 2 . 

EXAMPLE 

Determine the magnitude of the vector 

a = 5i — 2j + k 

and hence obtain a unit vector in the same direction. 

Solution 

|a| = a = ^5 2 + (-2) 2 + l 2 = v^. 

Hence, a unit vector in the same direction as a is obtained by normalising a; that is, dividing 
it by its own magnitude. 

The required unit vector is 

^ 1 5i — 2j + k 

cl — — .cl — . . 

a V30 

8.2.3 THE SUM AND DIFFERENCE OF VECTORS IN COMPONENT FORM 

We consider, first, a situation in two dimensions where two vectors are added together. 


y 



In the diagram, suppose Pi has co-ordinates (aq, yi) and suppose P 2 has co-ordinates (x 2 , |/ 2 ). 


2 



Then, since the triangle ORPi has exactly the same shape as the triangle P 2 SQ, the 
co-ordinates of Q must be (aq + X2, '])\ + 1/2) ■ 

But, by the Parallelogram Law, OQ is the sum of OPi and OP 2 . 

That is, 

(®ii + 2 /ij) + (x 2 i + y 2 j) = {xi + x 2 )i + ( 2/1 + y 2 ) j, 

showing that the sum of two vectors may be found by adding together their separate com- 
ponents. 

It can be shown that this result applies in three dimensions also and that, to find the 
difference of two vectors, we calculate the difference of their separate components. 

EXAMPLE 

Two points P and Q in space have cartesian co-ordinates (—3, 1, 4) and (2, —2, 5) respectively. 
Determine the vector PQ. 

Solution 

We are given that 

OP = — 3i + j + 4k and OQ = 2i — 2j + 5k. 



Q 


o 


By the triangle Law, 


PQ = OQ — OP = 5i — 3i + k. 


Note: 

The vector PQ is, of course, the vector drawn from the point P to the point Q and it may 
seem puzzling that the result just obtained appears to be a vector drawn from the origin to 
the point (5, —3, 1). However, we need to use again the fact that we are dealing with free 
vectors and the vector drawn from the origin to the point (5, —3, 1) is parallel and equal in 
length to PQ; in other words, it is the same as PQ. 
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8.2.4 THE DIRECTION COSINES OF A VECTOR 


Suppose that 


OP = r = x\ + yj + zk 


and suppose that OP makes angles a , (3 and 7 with Ox , O y and Oz respectively. 
Then, 


cosa = 


x 

r 


cos / 3 


- and cos 7 
r 


z 


r 



The three quantities cosa, cos [3 and cos 7 are called the “direction cosines” of r. 

Any three numbers in the same ratio as the direction cosines are said to form a set of 
“direction ratios” for the vector r and we note that x : y : z is one possible set of direction 
ratios. 

EXAMPLE 

The direction cosines of the vector 

6i + 2j — k 


V3T V4l V4I’ 

since the vector has magnitude y / 36 + 4 + 1 = a/4I. 

A set of direction ratios for this vector are 6 : 2 : — 1. 
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8.2.5 EXERCISES 


1. The position vectors of two points P and Q are, respectively, 

ri = 2i + 3j — k and r 2 = 4i — 3j + 2k. 

Determine the vector PQ in terms of i , j and k and hence obtain 

(a) its magnitude; 

(b) its direction cosines. 

2. Obtain a unit vector which is parallel to the vector a = 3i — j + 5k. 

3. If a = 3i — j — 4k, b = — 2i + 4j — 3k and c = i + 2j — k, determine the following: 

(a) 

2a — b + 3c; 

(b) 

| a 4" b T c | ; 

(c) 

1 3a — 2b + 4c|; 

(d) a unit vector which is parallel to 

3a — 2b + 4c. 

4. Prove that the vectors a = 3i + j — 2k, b = — i + 3j + 4k and c = 4i — 2j — 6k can form 
the sides of a triangle. 

Determine also the lengths of the “medians” of this triangle (that is, the lines joining 
each vertex to the mid-point of the opposite side). 
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8.2.6 ANSWERS TO EXERCISES 


1. PQ = 2i - 6j + 3k. 

(a) |PQ| = 7; 

(b) The direction cosines are — I and |. 

2 . 



3i - j + 5k 
y/35 • 

3. (a) 

Hi - 8k; 

(b) 

V93] 

(c) 

V398] 

(d) 

, 17i - 3j - 10k 
V398 

4. a = b + c; 

therefore the vectors form a triangle. 


The medians have lengths equal to 


5 


\J'^ ■ V6 and 


a/114 

2 
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UNIT 8.3 - VECTORS 3 


MULTIPLICATION OF ONE VECTOR BY ANOTHER 
8.3.1 THE SCALAR PRODUCT (or “Dot” Product) 

DEFINITION 

The “Scalar Product” of two vectors a and b is defined as abcos#, where # is the angle 
between the directions of a and b, drawn so that they have a common end-point and are 
directed away from that point. The Scalar Product is denoted by a • b so that 


a • b = ab cos 9 



0 < 9 < 7T 


Scientific Application 

If b were a force of magnitude b, then bcos 9 would be its resolution (or component) along 
the vector a. Hence, a • b would represent the work done by b in moving an object along the 
vector a. Similarly, if a were a force of magnitude a, then acos 9 would be its resolution (or 
component) along the vector b. Hence, a • b would represent the work done by a in moving 
an object along the vector b. 

8.3.2 DEDUCTIONS FROM THE DEFINITION OF DOT PRODUCT 

(i) a • a = a 2 . 

Proof: 

Clearly, the angle between a and itself is zero so that 

a« a = a. a cos 0 = a 2 . 

(ii) a • b can be interpreted as the magnitude of one vector times the perpendicular projection 
of the other vector onto it. 

Proof: 

bcos# is the perpendicular projection of b onto a and acos# is the perpendicular projection 
of a onto b. 
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(iii) a • b = b • a. 

Proof: 

This follows since abcos# = bacos#. 

(iv) Two non-zero vectors are perpendicular if and only if their Scalar Product is zero. 

Proof: 

a is perpendicular to b if and only if the angle 9 — |; that is, if and only if cos 9 = 0 and 
hence, abcos# = 0. 

(v) a • (b + c) = a • b + a • c. 



The result follows from (ii) since the projections PR and PQ of b and c respectively onto a 
add up to the projection PS of b + c onto a. 

Note: 

We need to observe that RS is equal in length to PQ. 

(vi) The Scalar Product of any two of the standard unit vectors i, j and k is given by the 
following multiplication table: 


• 

i 

j 

k 

i 

1 

0 

0 

j 

0 

1 

0 

k 

0 

0 

1 


That is, i • i = 1, j • j = 1 and k • k = 1; 
but, 

i • j = 0, i • k = 0 and j • k = 0. 
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8.3.3 THE STANDARD FORMULA FOR DOT PRODUCT 


If 

then 


a = Oii + a 2 j + a 3 k and b = &ii + b 2 j + 63 k, 
a • b = aibi + a 2 b 2 + a 3 b 3 . 


Proof: 

This result follows easily from the multiplication table in (vi). 
Note: The angle between two vectors 

If 9 is the angle between the two vectors a and b, then 


cos 9 


a • b 
ab 


Proof: 

This result is just a restatement of the original definition of a Scalar Product. 


EXAMPLE 

If 

a = 2i + 2j — k and b = 3j — 4k. 


then, 


Hence, 


a*b = 2x0 + 2x3 + (-1) x (-4) = 10. 


cos 9 


10 

V2 2 + 2 2 + 1 V3 2 + 4 2 


10 _ 2 
15 “ 3' 


Thus, 


9 = 48.19° or 0.84 radians. 
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8.3.4 THE VECTOR PRODUCT (or “Cross” Product) 
DEFINITION 


If 9 is the angle between two vectors a and b, drawn so that they have a common end-point 
and are directed away from that point, then the “Vector Product” of a and b is defined 
to be a vector of magnitude 

ab sin 9, 

in a direction which is perpendicular to the plane containing a and b and in a sense which 
obeys the “right-hand-thread screw rule” in turning from a to b. The Vector Product 
is denoted by 

a x b. 



0 < 9 < n 


Scientific Application 

Consider the following diagram: 



Suppose that the vector OA = a represents a force acting at the point O and that the vector 
OB = b is the position vector of the point B. Let the angle between the two vectors be 9. 

Then the “moment” of the force OA about the point B is a vector whose magnitude is 

ab sin 9 

and whose direction is perpendicular to the plane of O, A and B in a sense which obeys the 
right-hand-thread screw rule in turning from OA to OB . That is 

Moment = a x b. 


4 



Note: 

The quantity b sin 9 is the perpendicular distance from the point B to the force OA . 

8.3.5 DEDUCTIONS FROM THE DEFINITION OF CROSS PRODUCT 


(i) 


a x b = — (b x a) = (— b) x a = b x (—a). 


Proof: 

This follows easily by considering the implications of the right-hand-thread screw rule. 

(ii) Two vectors are parallel if and only if their Cross Product is a zero vector. 

Proof: 

Two vectors are parallel if and only if the angle, 9, between them is zero or n. In either case, 
sin 9 = 0, which means that ab sin 9 = 0; that is, |a x b| = 0. 

(iii) The Cross Product of a vector with itself is a zero vector. 

Proof: 

Clearly, the angle between a vector, a, and itself is zero. Hence, 

la x a| = a. a. sin 0 = 0. 


(iv) 


ax (b + c) = axb + axc. 


Proof: 

This is best proved using the standard formula for a Cross Product in terms of components 
(see 8.3.6 below). 

(v) The multiplication table for the Cross Products of the standard unit vectors i, j and k 
is as follows: 


X 

i 

j 

k 

i 

O 

k 

-j 

j 

- k 

O 

i 

k 

j 

— i 

O 


That is, 

i x i = O, j x j = O, k x k = O, ixj = k, jxk = i, kxi=j,jxi = — k, 
k x j = — i and i x k = — j. 

8.3.6 THE STANDARD FORMULA FOR CROSS PRODUCT 


If 

a = a i i + ci 2 j + 03 k and b = b \ i + b 2 } + 63 k, 


then, 


a x b = (a 2 &3 - 03 ^ 2)1 - (ai&3 - fl3&i)j + (ai &2 - a 2 bi)\i. 
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This is usually abbreviated to 


a x b = 


i j k 

0. i CL 2 O3 

bi b 2 b 3 


the symbol on the right hand side being called a “determinant” (see Unit 7.2). 

EXAMPLES 


1. If a = 2i + 2j — k and b = 3 j — 4k, determine a x b. 

Solution 


a x b = 


1 J k 

2 2-1 
0 3-4 


= (-8 + 3)i - (-8 - 0)j + (6 - 0)k = -5i + 8j + 6k. 


2. Show that, for any two vectors a and b, 

(a + b) x (a — b) = 2(b x a) 


Solution 

The left hand side = 
That is, 


axa — axb + bxa — bxb. 


0 + bxa + bxa = 2(bxa). 

3. Determine the area of the triangle defined by the vectors 

a = i + j + k and b = 2i — 3j + k. 

Solution 

If 9 is the angle between the two vectors a and b, then the area of the triangle is 
f ab sin 9 from elementary trigonometry. The area is therefore given by 


-la x b|. 
2 1- -1 


That is, 


Area = - 
2 


i j k 

111 
2-3 1 


= 2 l 4i + J “ 5 k l' 


This gives 


Area = -V16 + 1 + 25 = -V42 ~ 3.24 
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8.3.7 EXERCISES 


1. In the following cases, evaluate the Scalar Product a • b and hence determine the angle, 
6 between a and b: 

(a) a = 2i + 2j — k and b = 3i — 6j + 2k; 

(b) a = i + 2j — k and b = 3j + k; 

(c) a = — i — j + 4k and b = 7i — 2k. 

2. Find out which of the following pairs of vectors are perpendicular and determine the 
cosine of the angle between those which are not: 

(a) 3j and 2j — 2k; 

(b) i + 3j — 5k and — i + 2j + k; 

(c) 2i + 10k and 7j; 

(d) 2i + 2j — k and 6i — 3j + 2k. 

3. If a = 2i + j + k, b = i — 2j — 2k and c = 3i — 4j + 2k, determine the length of the 
projection of a + c onto b. 

4. If a = 2i — 3j + 5k and b = 3i + j — 2k, evaluate 

(a + b) • (a — b) . 

5. Determine the components of the vector a x b in the following cases: 

(a) a = 2i + 2j — k and b = 3i — 6j + 2k; 

(b) a = i + 2j — k and b = 3j + k; 

(c) a = — i — j + 4k and b = 7i — 2k. 

6. If a = 3i — j + 2k and b = i + 3j — 2k, show that a x b is perpendicular to the vector 
c = 9i + 2j + 2k. 

7. Given that a x b is perpendicular to each one of the vectors a and b, determine a unit 
vector which is perpendicluar to each one of the vectors a = 2i — j + k and b = 3i + 

4j - k. 

Calculate also the sine of the angle, 9 , between a and b. 

8. Determine the area of the triangle whose vertices are the points A(3, — 1,2), 

B(l, -1,-3) and C(4, — 3,1) in space. State your answer correct to two places of 
decimals. 
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8.3.8 ANSWERS TO EXERCISES 


1. (a) Scalar Product = —8, cos 9 ~ —0.381 and 6 ~ 112.4°; 

(b) Scalar Product = 5, cos 9 ~ 0.645 and 9 ~ 49.80°; 

(c) Scalar Product = —15, cosd ~ —0.485 and 6 ~ 119.05° 

2. (a) Cosine ~ 0.707; 

(b) The vectors are perpendicular; 

(c) The vectors are perpendicular; 

(d) Cosine ~ 0.190 

3. The length of the projection is |. 

4. The value of the Dot Product is 24. 

5. (a) The components are —2, —7, —18; 

(b) The components are 5, —1, 3; 

(c) The components are 2, 26, 7. 


6. Show that (a x b) • c = 0. 

7. A unit vector is 


and 


— 3i + 5j + Ilk 
1 a/155 


sin 0 


a/I55 

V6.V26 


~ 0.997 


The area is 6.42 
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UNIT 8.4 - VECTORS 4 
TRIPLE PRODUCTS 
INTRODUCTION 

Once the ideas of scalar (dot) product and vector (cross) product for two vectors has been 
introduced, it is then possible to consider certain products of three or more vectors where, 
in some cases, there may be a mixture of scalar and vector products. 

8.4.1 THE TRIPLE SCALAR PRODUCT 

DEFINITION 1 

Given three vectors a, b and c, expressions such as 

a • (b x c) , b • (c x a) , c • (a x b) 
or 

(a x b) • c, (b x c) • a, (c x a) • b 

are called “triple scalar products” because their results are all scalar quantities. Strictly 
speaking, the brackets are not necessary because there is no ambiguity without them; that 
is, it is not possible to form the vector product of a vector with the result of a scalar product. 

In the work which follows, we shall take a • (b x c) as the typical triple scalar product. 

The formula for a triple scalar product 

Suppose that 


a = Gqi + a 2 j + a 3 k, b = &ii + b 2 j + 63k and c = cii + c 2 j + c 3 k. 

Then, 

i j k 

a • (b x c) = (cqi + a 2 j + a 3 k) • b\ b 2 & 3 . 

Cl c 2 c 3 
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From the basic formula for scalar product, this becomes 

a i a 2 03 

h b 2 b 3 

ci C2 c 3 

Notes: 

(i) From Unit 7 . 3 , if two rows of a determinant are interchanged, the determinant remains 
unchanged in numerical value but is altered in sign. 

Hence, 

a • (b x c) = —a • (c x b) = c • (a x b) = — c • (b x a) = b • (c x a) = — b • (axe). 

In other words, the “cyclic permutations” of a • (bxc) are all equal in numerical value 
and in sign, while the remaining permutations are equal to a • (bxc) in numerical value, 
but opposite in sign. 

(ii) The triple scalar product, a • (bxc), is often denoted by [a, b, c]. 

EXAMPLE 

Evaluate the triple scalar product, a • (bxc), in the case when 

a = 2i + k, b = i + j + 2k and c = — i + j 

Solution 

2 0 1 

a • (b x c) = 1 1 2 = 2.(-2) - 0.(2) + 1.(2) = -2. 

-110 

A geometrical application of the triple scalar product 

Suppose that the three vectors a, b and c lie along three adjacent edges of a parallelepiped 
(correct pronunciation, “parallel- epi-ped” ) as shown in the following diagram: 


a • (b x c) = 


2 



b x c 

b 

The area of the base of the parallelepiped, from the geometrical properties of vector products, 
is the magnitude of the vector, bxc, which is perpendicular to the base. 

The perpendicular height of the parallelepiped is the projection of the vector a onto the 
vector bxc; that is, 

a • (bxc) 

|b x c| 

ffence, since the volume, V, of the parallelepiped is equal to the area of the base times the 
perpendicular height, we conclude that 

V = a • (bxc), 

at least numerically, since the triple scalar product could turn out to be negative. 

Note: 

The above geometrical application also provides a condition that three given vectors, a, b 
and c lie in the same plane; that is, they are “coplanar”. 

The condition is that 

a • (b x c) = 0, 

since the three vectors would determine a parallelepiped whose volume is zero. 
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8.4.2 THE TRIPLE VECTOR PRODUCT 


DEFINITION 2 

If a, b and c are any three vectors, then the expression 

a x (b x c) 

is called the “triple vector product” of a with b and c. 

Notes: 

(i) The triple vector product is clearly a vector quantity. 

(ii) The inclusion of the brackets in a triple vector product is important since it can be shown 
that, in general, 


ax(bxc)/(axb)xc. 


For example, if the three vectors are considered as position vectors, with the origin as a 
common end-point, then a x (b x c) is perpendicular to both a and b x c, the latter of which 
is already perpendicular to both b and c. It therefore lies in the plane of b and c. 

Consequently, (a x b) x c, which is the same as — c x (a x b), will he in the plane of a and 
b. 

Hence it will, in general, be different from ax (b x c). 

The formula for a triple vector product 

Suppose that 


a = Gqi + 0-2 j + 03k, b = &ii + + hk and c = cii + C 2 j + 03k. 


Then, 


a x (b x c) = a x 


i 

61 

Cl 


j k 

b 2 h 

c 2 c 3 
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1 


k 


j 

Cl i O 2 

(6 2 c 3 - 6 3 c 2 ) (6 3 ci - 6ic 3 ) 


a 3 

(6ic 2 - 6 2 ci) 


The i component of this vector is equal to 


a 2 (&ic 2 - b 2 ci) - a 3 (6 3 ci - &ic 3 ) = fei(a 2 c 2 + a 3 c 3 ) - ci(a 2 6 2 + a 3 6 3 ); 
but, by adding and subtracting aib\Ci, the right hand side can be rearranged in the form 


(cqci + u 2 c 2 + a 3 c 3 )&i — (ai^i + q 2 6 2 + a 3 & 3 )ci, 
which is the i component of the vector (a • c)b — (a • b)c. 

Similar expressions can be obtained for the j and k components and we may conclude that 

a x (b x c) = (a • c)b — (a • b)c. 

EXAMPLE 

Determine the triple vector product of a with b and c, where 

a = i + 2 j — k, b = — 2i + 3j and c = 3k. 


Solution 


a • c = —3 and a • b = 4. 


Hence, 


a x (b x c) = —3b — 4c = 6i — 9j — 12k. 


5 



8.4.3 EXERCISES 


1. Evaluate the triple scalar product, a • (b x c), in the case when 

a = 2i + j — 3k, b = i — 2 j + k and c = — i + j — 4k. 

2. Determine the volume of the parallelepiped with adjacent edges defined by the vectors 

a=— i+j + 3k, b = 2i — j and c = i+j + k. 

3. Determine the triple vector product of a with b and c in the cases where 

(a) 

a = — i + j + 3k, b = 2i + j and c = i + j + k; 

(b) 

a = 4i — k, b = 3i + 5j — k and c = i — j — k; 

(c) 

a = 4i + 2j + k, b = 5i and c = — j + 3k. 

4. Show that the following three vectors are coplanar: 

a = — i + 3j — 2k, b = 4i — 6j + 2k and c = — 3i + 12j — 9k. 

5. Show that 

[(a + b), (b + c), (c + a)] = 2[a,b,c]. 

6. Show that 

(a x b) x(c x d) = [a, c, d]b — [b, c, d]a = [a, b, d]c — [a, b, c]d. 

7. Show that 


8. Show that 


a x (b x c) + b x (c x a) + c x (a x b) = O. 


(a x b) • (c x d) 


a • c a • d 

b • c b • d 
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8.4.4 ANSWERS TO EXERCISES 


1 . 

20 . 

2 . 

8 . 

3. (a) 

7i + 4j + k; 

(b) 

2i + 38j + 8k; 

(c) 

5i + 20j - 60k. 

4. Show that the triple scalar product is zero. 

5. Remove all brackets and use the fact that a triple scalar product is zero when two of 
the vectors are the same. 

6. Use the triple vector product formula. 

7. Use the triple vector product formula. 

8. Rearrange in the form 

a • [b x (c x d)]. 
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UNIT 8.5 - VECTORS 5 


VECTOR EQUATIONS OF STRAIGHT LINES 
8.5.1 INTRODUCTION 

The concept of vector notation and vector products provides a convenient method of repre- 
senting straight lines and planes in space by simple vector equations. Such vector equations 
may then, if necessary, be converted back to conventional cartesian or parametric equations. 

We shall assume that the position vector of a variable point, P (x,y,z), is given by 


r = a;i + yj + z k, 


and that the position vectors of fixed points, such as Pi(xi, yi, zi) and P2<A2, y-2, z 2 ), are 
given by 


U = xii + yij + zik, V2 = x 2 i + y 2 j + z 2 k, etc. 

8.5.2 THE STRAIGHT LINE PASSING THROUGH A GIVEN POINT AND 
PARALLEL TO A GIVEN VECTOR 

We consider, here, the straight line passing through the point, Pi, with position vector, ri, 
and parallel to the vector, a = cqi + a2j + 03k. 
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From the diagram, 


OP = OPi + PiP. 


But, 


PiP = fa, 

for some number t. 

Hence, 


r = lq + fa, 

which is the vector equation of the straight line. 

The components of a form a set of direction ratios for the straight line. 

Notes: 

(i) The vector equation of a straight line passing through the origin and parallel to a given 
vector a will be of the form 


r = fa. 


(ii) By equating i, j and k components on both sides, the vector equation of the straight line 
passing through Pi and parallel to a leads to parametric equations 


x = x\ + Gqf, y = yi + a2f, z = z\ + a^t; 


and, if these are solved for the parameter, f, we obtain 

x - xi _ y - yi _ z - Z! 

CL i (22 ^3 


which is the standard cartesian form of the straight line. 
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EXAMPLES 


1. Determine the vector equation of the straight line passing through the point with posi- 
tion vector i — 3 j + k and parallel to the vector, 2i + 3j — 4k. 

Express the vector equation of the straight line in standard cartesian form. 

Solution 

The vector equation of the straight line is 

r = i — 3j + k + f(2i + 3 j — 4k) 


or 


x\ + yj + zk = (1 + 2t)i + (—3 + 3f)j + (1 — 4f)k. 

Eliminating t from each component, we obtain the cartesian form of the straight line, 

x — 1 y + 3 z — 1 
2 “ 3 ~~ -4 

2. The equations 

3a: + 1 y — 1 —z + 5 
2 “ 2 “ 3 

determine a straight line. Express them in vector form and obtain a set of direction 
ratios for the straight line. 

Solution 

Rewriting the equations so that the coefficients of x, y and z are unity, we have 

x + | y — 1 z — 5 


Hence, in vector form, the equation of the line is 

r = — ^i+j + 5k + f (Ji + 2j-3k) . 

Thus, a set of direction ratios for the straight line are | : 2 : —3 or 2 : 6 : —9. 
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3. Show that the two straight lines 


where 


r = ri + t/&i and r = T 2 + ta 2 , 


ri=j, ai = i + 2j-k, 


and 


U = i+j + k, a2 = — 2i — 2 j , 

have a common point and determine its co-ordinates. 

Solution 

Any point on the first line is such that 


x = t, y — 1 + 2t, z = —t, 

for some parameter value, f; and any point on the second line is such that 


x = 1 — 2s, y = 1 — 2s, z — 1, 
for some parameter value, s. 

The lines have a common point if it is possible to find values of t and s such these are 
the same point. 

In fact, t = —1 and s — 1 are suitable values and give the common point (—1, —1, 1). 


8.5.3 THE STRAIGHT LINE PASSING THROUGH TWO GIVEN POINTS 

If a straight line passes through the two given points, Pi and P 2 , then it is certainly parallel 
to the vector, 


a = PjP 2 = (x 2 - au)i + ( 2/2 - 2/1 )j + (z 2 - 2 i)k. 
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Thus, the vector equation of the straight line is 

r = ri + ta 

as before. 

Notes: 

(i) The parametric equations of the straight line passing through the points, Pi and P2, are 

x = xi + (x 2 - xi)t, y = yi + (y 2 - yi)t, z = z 1 + (z 2 - zi)t\ 

and we notice that the “base-points” of the parametric representation (that is, Pi and P2) 
have parameter values t — 0 and t — 1 respectively. 

(ii) The standard cartesian form of the straight line passing through Pi and P2 is 

x - X! _ y-yi _ z-Zj 
x 2 - x 1 y 2 - y± z 2 - z 1 ' 
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EXAMPLE 


Determine the vector equation of the straight line passing through the two points, Pi (3, —1, 5) 
and P 2 (-l, -4, 2). 


Solution 


OPi = 3i — j + 5k 


and 


P jP 2 — ( — 1 — 3)i + ( — 4 + l)j + (2 — 5)k — — 4i — 3j — 3k. 


Hence, the vector equation of the straight line is 

r = 3i — j + 5k — f(4i + 3j + 3k). 

8.5.4 THE PERPENDICULAR DISTANCE OF A POINT FROM A STRAIGHT 
LINE 

For a straight line, /, passing through a given point, A, with position vector, a and parallel 
to a given vector, b, it may be necessary to determine the perpendicular distance, d, from 
this line, of a point, C, with position vector, c. 

I 



From the diagram, with Pythagoras’ Theorem, 
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d 2 = (AC) 2 - (AB) 2 . 


But, AC = c — a, so that 

(AC) 2 = (c — a) • (c — a). 

Also, the length, AB, is the projection of AC onto the line, /, which is parallel to b. 
Hence, 


which gives the result 

d 2 = (c — a) • (c — a) — 

From this result, d may be deduced. 

EXAMPLE 

Determine the perpendicular distance of the point (3,— 1,7) from the straight line passing 
through the two points, (2, 2, —1) and (0, 3, 5). 

Solution 

In the standard formula, we have 


(c — a) • b 
b 


a = 2i + 2j - j, 

b = (0 - 2)i + (3 - 2)j + (5 - [-l])k = -2i + j + 6k, 
b = y/(-2) 2 + l 2 + 6 2 = a/41, 
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c = 3i — j + 7k, 


and 


c - a = (3 - 2)i + (-1 - 2)j + (7 - [— l])k = i - 3j + 8k. 


Hence, the perpendicular distance, d, is given by 


d 2 


1 2 + (-3) 2 + 8 2 


(1)(— 2) + (~3)(1) + (8)(6) 
V4l 


74- 


43 

Tii 


which gives d ~ 8.20. 

8.5.5 THE SHORTEST DISTANCE BETWEEN TWO PARALLEL STRAIGHT 
LINES 

The result of the previous section may also be used to determine the shortest distance 
between two parallel straight lines, because this will be the perpendicular distance from one 
of the lines of any point on the other line. 

We may consider the perpendicular distance between 

(a) the straight line passing through the fixed point with position vector i'i and parallel to 
the fixed vector, a 

and 

(b) the straight line passing through the fixed point with position vector r 2 , also parallel to 
the fixed vector, a. 

These will have vector equations, 


r = ri + fa and r = r 2 + ta. 
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In the diagram, F is the foot of the perpendicular onto the second line from the point Pi on 
the first line and the length of this perpendicular is d. 

Hence, 


d 2 = (£2 - £1) • (p2 — £1) 


(£2 ~ £1 ) • a 
a 


EXAMPLE 

Determine the shortest distance between the straight line passing through the point with 
position vector i'i = 4i — j + k, parallel to the vector b = i + j + k, and the straight line 
passing through the point with position vector i - 2 = — 2i + 3j — k, parallel to b. 

Solution 

From the formula, 


d 2 = (— 6i + 4j — 2k) • (— 6i + 4j — 2k) 


(— 6i + 4j - 2k) • (i + j + k) 

V3 


That is, 


d 2 = (36 + 16 + 4) 


-6 + 4-2 

V3 



152 

IT’ 
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which gives 


7.12 

8.5.6 THE SHORTEST DISTANCE BETWEEN TWO SKEW STRAIGHT LINES 

Two straight lines are said to be “skew” if they are not parallel and do not intersect each 
other. It may be shown that such a pair of lines will always have a common perpendicular 
(that is, a straight line segment which meets both, and is perpendicular to both). Its length 
will be the shortest distance between the two skew lines. 

For the straight lines, whose vector equations are 


r = ri + tag and r = rg + fag, 

suppose that the point, M t , on the first line and the point, M 2 , on the second line are the 
ends of the common perpendicular and have position vectors, mi and rn 2 , respectively. 

Then, 

mi = ri + f iai and nig = rg + f 2 ag, 
for some values, t\ and t 2 , of the parameter, t. 



Firstly, we have 
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MjM 2 = m 2 - mi = (rg - dJ i t - 2 ag - hai. 


Secondly, a vector which is certainly perpendicular to both of the skew lines is ai x a 2 , so 
that a unit vector perpendicular to both of the skew lines is 


3\ X cl 2 
ell X 8>2 


This implies that 


(l2 - D_) + t 2 &2 - t iai 


id 


db\ X c&2 
ctl X 3,2 


5 


where d is the shortest distance between the skew lines. 

Finally, if we take the scalar (dot) product of both sides of this result with the vector ai x a^, 
we obtain 


giving 


(l2 -£i) 


(ai x &£) 


id 


ai x a^l 
|ai x a 2 



(gg ~ pj * (ai x ag) 
|ai x a 2 | 


EXAMPLE 

Determine the perpendicular distance between the two skew lines 

r = t|i fai and r = r^ i t& 2 , 


where 


ri = 9 j i 2k, ai = 3i — j + k, 
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i2 = — 6i — 5 j + 10k, a2 = — 3i + 2 j + 4k. 


Solution 


r 2 — rx = — 6i — 14j + 8k 


and 

i j k 

ai xa2 = 3 —1 1= — 6i — 15j + 3k, 

_ -3 2 4 

so that 


C 6) ( 6) + (— 14)(— 15) + (8)(3) 
v / 3tr+~225~+~9 


= V270 = 3^30. 


8.5.7 EXERCISES 


1. Determine the vector equation, and hence the parametric equations, of the straight line 
which passes through the point, (5, —2, 1), and is parallel to the vector i — 3 j + 2k. 

2. The equations 

—x + 2 ?yy — 1 2^ + 1 

7 “ 5 “ 3 

determine a straight line. Determine the equation of the line in vector form, and state 
a set of direction ratios for this line. 

3. Show that there is a point common to the two straight lines 

r = ri + t % and r = ijj + t& 2 , 

where 

ri = 3j + 2k, r2 = -2i + 2 j + 3k, 
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and 


ai = 2i — 4j — 2k, a2 = 9i — 3j — 6k 

Determine the co-ordinates of the common point. 

4. Determine, in standard cartesian form, the equation of the straight line passing through 
the two points, (—2,4,9) and (2,— 1,6). 

5. Determine the perpendicular distance of the point (0, —2, 5) from the straight line which 
passes through the point (1, —1, 3) and is parallel to the vector 3i + j + k. 

6. Determine the shortest distance between the two parallel straight lines 

r = ri + fa and r = £2 + fa, 

where 

£1 = -3i + 2j + k, £2 = 2i - j - k 

and 

a = i + 5j + k. 

7. Determine the shortest distance between the two skew straight lines 

r = £1 + f aj_ and r = £2 + t& 2 , 

where 

£1 = i - 2j + k, £2 = -i + 3j + 2k, 

and 

ai = i— j — k, a 2 = 5i + j — 2k 

8.5.8 ANSWERS TO EXERCISES 

1. 

r = (5i - 2j + k + t(i - 3j + k), 

giving 

x = 5 + t, = — 2 — 37, z — 1 + t. 
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2. In vector form, the equation of the line is 


_ . -L. -L, / , 

r = 2i + l k + t ( — 7i + j k 

— o \ o o 


and set of direction ratios is 


-42 : 10 : 9 


3. The common point has co-ordinates (1, 1, 1). 

4. 

x + 2 y — 4 z — 9 
4 “ -5 “ -3 


5. 


6 . 


7. 




~ 2.04 
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UNIT 8.6 - VECTORS 6 


VECTOR EQUATIONS OF PLANES 

8.6.1 THE PLANE PASSING THROUGH A GIVEN POINT AND 
PERPENDICULAR TO A GIVEN VECTOR 

A plane in space is completely specified if we know one point in it, together with a vector 
which is perpendicular to the plane. 


n 



In the diagram, the given point is Pi, with position vector, i'i, and the given vector is n. 
Hence, the vector, PiP, is perpendicular to n, which leads to the equation 


or 


(r — ri) • n = 0 


r • n = ri • n = d say. 


Notes: 

(i) In the particular case when n is a unit vector, the constant, d, represents the perpendicular 
projection of ri onto n, which is therefore the perpendicular distance of the origin from the 
plane. 

(ii) If r = x\ + yj + zk and n = ai + fej + ck, then the cartesian form for the equation of the 
above plane will be 
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ax + by + cz = d. 


That is. it is simply a linear equation in the variables x, y and z. 

EXAMPLE 

Determine the vector equation and, hence, the cartesian equation of the plane passing 
through the point with position vector 3i — 2j + k and perpendicular to the vector i — 4j — k. 

Solution 

The vector equation is 


r . (i - 4j - k) = (3) (1) + ( — 2)( — 4) + (1)(-1) = 10 


and, hence, the cartesian equation is 


x — Ay — z = 10. 

8.6.2 THE PLANE PASSING THROUGH THREE GIVEN POINTS 

We consider a plane passing through the points, Pi(aq, yi, Zi), P2<A2, 2/2, ^2) and Ps(iC3, 2/3, ^3). 


n 



In the diagram, a suitable vector for n is 
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k 


P 1 P 2 X P 1 P 3 


i j 

x 2 -Xi y 2 - yi Z 2 - z 1 
x 3 ~ xi y 3 - yi z 3 - z 1 


and, hence, the equation, 


(r — ri) • n = 0 , 


of the plane becomes 


X — X\ 
x 2 - X\ 
x 3 - Xi 


y-y 1 

2/2 - yi 

y 3 - yi 


Z — Z 1 

£2 - Zi 
Z 3 - Zi 


0 . 


But, from the properties of determinants, this is equivalent to 


X 

y 

z 

X\ 

y\ 

Z 1 

X 2 

2/2 

Z 2 

X 3 

ys 

Z 3 


0, 


which is the standard equation of the plane through the three given points. 

EXAMPLE 

Determine the cartesian equation of the plane passing through the three points, (0,2, — 1), 
(3,0,1) and (-3, -2,0). 

Solution 

The equation of the plane is 


x y z 1 

0 2-11 
3 0 11 

-3-2 0 1 


0, 


which, on expansion and simplification, gives 
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2x — 3y — 6z = 0, 


showing that the plane also passes through the origin. 

8.6.3 THE POINT OF INTERSECTION OF A STRAIGHT LINE AND A 
PLANE 

First, we recall (from Unit 8.5) that the vector equation of a straight line passing through 
the fixed point, with position vector ri, and parallel to the fixed vector a, is 

r = iq + t a. 


For the point of intersection of this line with the plane, whose vector equation is 


r • n = d, 


the value of t must be such that 


(ri + ta) • n = d, 

which is an equation from which the appropriate value of t and, hence, the point of intersec- 
tion may be found. 

EXAMPLE 

Determine the point of intersection of the plane, whose vector equation is 

r • (i — 3j — k) = 7, 

and the straight line passing through the point, (4, —1,3), which is parallel to the vector 

2i — 2j + 5k. 

Solution 

We need to obtain the value of the parameter, t, such that 

(4i — j + 3k + t[2i — 2 j + 5k]) • (i — 3j — k) = 7. 
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That is, 


(4 + 2t)(l) + (-1 - 2t)(—3) + (3 + 5t)(— 1) = 7 or 4 + 3t = 7, 

which gives t — 1; and, hence, the point of intersection is (4 + 2, —1 — 2, 3 + 5) = (6, —3, 8). 

8.6.4 THE LINE OF INTERSECTION OF TWO PLANES 

Suppose we are given two non-parallel planes whose vector equations are 

r • rq = d\ and r • i q = d 2 - 

Their line of intersection will be perpendicular to both iq and iq, since these are the normals 
to the two planes. 

The line of intersection will thus be parallel to iq x iq, and all it remains to do, to obtain 
the vector equation of this line, is to determine any point on it. 

For convenience, we may take the point (common to both planes) for which one of x, y or z 
is zero. 

EXAMPLE 

Determine the vector equation, and hence the cartesian equations (in standard form), of the 
line of intersection of the planes whose vector equations are 


r • ni = 2 and r • iq = 17, 


where 


iq = i + j + k and rq = 4i + j + 2k. 

Solution 

Firstly, 

i j k 

iq x iq = 1 1 1 = i + 2j — 3k. 

~ — 4i2 
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Secondly, the cartesian equations of the two planes are 


x + y + z = 2 and Ax + y + 2z = 17; 


and, when z — 0, these become 


x + y — 2 and 4a; + y = 17, 

which, as simultaneous linear equations, have a common solution of x — 

Thirdly, therefore, a point on the line of intersection is (5, —3,0), which 
5i - 3j. 

Hence, the vector equation of the line of intersection is 


5, 2/ = —3. 

has position vector 


r = 5i — 3j + t(i + 2j — 3k). 

Finally, since x = 5 + 1, y = —3 + 2 1 and z = —3 1 the line of intersection is represented, in 
standard cartesian form, by 


x — 5 y + 3 z 


8.6.5 THE PERPENDICULAR DISTANCE OF A POINT FROM A PLANE 

Given the plane whose vector equation is r • n — d and the point, Pi, whose position vector 
is i'i, the straight line through the point, Pi, which is perpendicular to the plane has vector 
equation 


r = i’i + t n. 


This line meets the plane at the point, Pq, with position vector iq + ton, where 


(ri + t 0 n) • n — d. 


That is, 
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(ri • n) + t 0 n 2 = d. 


Hence, 


to = 


d — (r^ • n) 




Finally, the vector P 0 Pi = (iy + t 0 n) — iq = t 0 ii 

and its magnitude, to 11 , will be the perpendicular distance, p, of the point Pi from the plane. 
In other words, 


d — (ii • n) 

P = = 

n 


Note: 

In terms of cartesian co-ordinates, this formula is equivalent to 

d — (ax i + by i + cz\) 

^ y/a 2 + b 2 + c 2 

where a, b and c are the i, j and k components of n respectively. 

EXAMPLE 

Determine the perpendicular distance, p, of the point (2, —3, 4) from the plane whose carte- 
sian equation is x + 2y + 2z = 13. 

Solution 

From the cartesian formula 


13 ~ [(f)(2) + (2)(— 3) + (2) (4)] 9 

Vl 2 + 2 2 + 2 2 3 
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8.6.6 EXERCISES 


1. Determine the vector equation and hence the cartesian equation of the plane, passing 
through the point with position vector i + 5j — k, and perpendicular to the vector 

2i + j — 3k. 

2. Determine the cartesian equation of the plane passing through the three points (1, —1, 2), 
(3, -2,-1) and (-1,4,0). 

3. Determine the point of intersection of the plane, whose vector equation is 

r • (5i — 2j — k) = -3, 

and the straight line passing through the point (2, 1, —3), which is parallel to the vector 

i+j - 4k. 

4. Determine the vector equation, and hence the cartesian equations (in standard form), 
of the line of intersection of the planes, whose vector equations are 

r • ii! = 14 and r • n 2 = —1, 


where 


ni = — 4i + 2j — k and 112 = 2i + j + 3k. 

5. Determine, in surd form, the perpendicular distance of the point (—5, —2, 8) from the 
plane whose cartesian equation is 2x — y + 3z = 17. 


8.6.7 ANSWERS TO EXERCISES 


1 . 


2 . 


r • (2i + j — 3k) = 10, or 2x + y — 3z = 10. 


3 . 


17x + 10 y + 8z = 23. 


(0,-1, 5). 



4 . 


r = — 2i + 3j + t( 7i + lOj — 8k) 


or 


5. 


x + 2 y — 3 
7 “ 10 


z 


-8 


(= t). 


1 

7S' 
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UNIT 9.1 - MATRICES 1 

DEFINITIONS AND ELEMENTARY MATRIX ALGEBRA 
9.1.1 INTRODUCTION 
(a) Presentation of Data 

Sets of numerical information can often be presented as a rectangular “array” of numbers. 
For example, a football results table might look like this: 


TEAM 

PLAYED 

WON 

DRAWN 

LOST 

Blackburn 

22 

11 

6 

5 

Burnley 

22 

9 

6 

7 

Chelsea 

21 

7 

8 

6 

Leicester 

21 

6 

8 

7 

Stoke 

21 

6 

6 

9 


If the headings are taken for granted, we write simply 

'22 11 6 5' 

22 9 6 7 

21 7 8 6 

21 6 8 7 

.21 6 6 9. 

and this symbol is called a “matrix” . 

Note: 

The mould in which printers once cast type was called a matrix. In mathematics, the word 
“matrix” signifies that we have spaces into which numbers can be placed. 

(b) Presentation of Algebraic Results 

In two-dimensional geometry, the “vector”, OPo, joining the origin to the point Po(^o,2/o) 
can be moved to the position OP , joining the origin to the point Pi (£ 1 , 2 / 1 ) by means of 
a “reflection”, a “rotation”, a “magnification” or a combination of such operations. 
It can be shown that the relationship between the co-ordinates of Po and Pi in any such 
operation is given by 


xi = ax 0 + by 0 , 
Vi = cx o + dy 0 . 
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ILLUSTRATIONS 


1. The equations 

Xi = -x 0 , 

yi = yo 

represent a reflection in the y- axis. 

2. The equations 

X\ = kx o, 

yi = ky 0 

represent a magnification when | k |> 1 and a contraction when | k |< 1. 

3. The equations 

X\ = xq cos 9 — y 0 sin 9 , 
y\ = xo sin 9 + yo cos 9 

represent a rotation of OP 0 through an angle 9 in a counter-clockwise direction. 


Snch an operation is called a “linear transformation” and is completely specified by the 
coefficients a, b, c, d in the correct positions. When referring to a linear transformation, it is 
therefore more convenient to write the matrix symbol 

a b' 
c d _ 

Sometimes, we want to show that a transformation 

' a b ' 
c d 


operates on the vector OP 0 , transforming it into the vector OP 1 . We write 


'xi 


'a 

b~ 


~Xq 

- l)\ - 


_ c 

d_ 


.yo. 


The two new symbols are still called matrices but can be given a special name as we see 
later. 
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9.1.2 DEFINITIONS 


(i) A matrix is a rectangular array of numbers arranged in rows (horizontally), columns 
(vertically) and enclosed in brackets. 


ILLUSTRATIONS: 


'1 

3 

7 ' 


.9 

9 

10. 

1 


10 16 

3 5 

4 0 


17 

11 

10 


! 5 7]. 


Note: 

The rows are counted from the top to the bottom of the matrix and the columns are counted 
from the left to the right of the matrix. 

(ii) Any number within the array is called an “element” of the matrix. The term ij — th element 
refers to the element lying in the i-th row and the j-th column of the matrix. 

(iii) If a matrix has m rows and n columns, it is called a “matrix of order m x n” or 
simply an “m x n matrix” . It clearly has mn elements. 

(iv) A matrix of order m x m is called a “square matrix” . 

Note: 

A matrix of order 1 x 1 is considered to be the same as a single number. This assumption 
has particular significance in the definition of a matrix product (see later). 

(v) A matrix of order m x 1 is called a “column vector” and a matrix of order 1 x n is 
called a “row vector”. 

Note: 

Matrices of order 2x1, 1x2, 3x1 and 1x3 have easy physical interpretations as vectors. 
But, in abstract linear algebra, there are vectors of any dimension; so the names “column 
vector” and “row vector” are retained no matter how many elements there are. 

(vi) An arbitrary matrix whose elements and order do not have to be specified may be 
denoted by a single capital letter such as A,B,C, etc. 

An arbitrary matrix of order m x n may be denoted fully by the symbol 


d n 

G 12 

®13 

• • &1 n 

d 21 

«22 

«23 

■ • ^2 n 

®m 1 

dm2 

dm3 • • 

■ • ^ mn 
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where, in each double-subscript, the first number is the row number and the second number 
is the column number. In a situation where the matrix has already been denoted by a single 
capital letter, the subsequent use of the full notation should include the corresponding small 
letter on which to attach the double-subscripts. 

If the matrix is square, the elements an, a 2 2 , ^33, ...a mm are called the “diagonal elements” 
and their sum is called the “trace” of the matrix. This line of elements is called the “leading 
diagonal” of the matrix. 

An abbreviated form of the full notation is [aid „ 

l vJmxn 

EXAMPLE 

A matrix A = [a.q] 2x 3 is such that a tJ = i 2 + 2 j. Write out A in full. 

Solution 

This is an artificially contrived example, but will serve to illustrate the use of the double- 
subscript notation. We obtain 


A 


'3 5 7 ' 

6 8 10 


(vii) Given a matrix, A, of order m x n, the matrix of order n x m obtained from A by 
writing the rows as columns is called the “transpose” of A and is denoted by A T - some 
books use A' or A. 


ILLUSTRATION 


If 


then, 


A 


a\ b\ 
a 2 b 2 , 
a 3 b 3 


A t 


a 1 a 2 a 3 
. bi b 2 b 3 


(viii) A matrix, A, is said to be “symmetric” if A = A T . 

ILLUSTRATION 


A 


a h q 

h b f 

9 f c 


is symmetric. 
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(ix) A matrix, A, is said to be “skew-symmetric” if the elements of A T are minus the 
corresponding elements of A itself. This will mean that the leading diagonal elements must 
be zero. 

ILLUSTRATION 


A 



a — b 
0 c 
-c 0 


is skew — symmetric. 


(x) A matrix is said to be “diagonal” if the elements which are not on the leading diagonal 
have value zero while the elements on the leading diagonal are not all equal to zero. 

9.1.3 THE ALGEBRA OF MATRICES (Part One) 

An “Algebra” (coming from the Arabic word AL-JABR) refers to any mathematical system 
which uses the concepts of equality, addition, subtraction, multiplication and division. For 
example, the algebra of numbers is what we normally call “arithmetic”; but algebraical 
concepts can be applied to other mathematical systems of which matrices is one. 

In meeting a new mathematical system for the first time, the concepts of equality, addition, 
subtraction, multiplication and division need to be properly defined, and that is the purpose 
of the present section. In some cases, the definitions are fairly obvious, but they need to 
be made without contradicting ideas already established in the system of numbers which 
matrices depend on. 

1. Equality 

Unlike a determinant, a matrix does not have a numerical value; so the use of equality here 
is not that which is made in elementary arithmetic. Rather we use it to mean “is the same 
as”. 

Two matrices are said to be equal if they have the same order and also pairs of elements 
in corresponding positions are equal in value. 


In symbols, we could say that 


provided 



mxn 


[K\ 


mxn 


Q>ij h 


v 
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For example, 


x 

y 

z 

implies that x — 1, y — 2 and z — 3. 

We shall meet this type of conclusion later in a discussion on the solution, by matrices, of a 
set of simultaneous equations (see Unit 9.3). 

2. Addition and Subtraction 

The scientific applications of matrices require us to define the sum and difference of two 
matrices only when they have the same order. 

The sum of two matrices of order m x n is formed by adding together the pairs of elements 
in corresponding positions. Similarly, the difference of two matrices of order m x n is formed 
by subtracting the pairs of elements in corresponding positions. 

In symbols, we could say that 

[®®ilinxn [^'ilraxn ' 

where 

Cij &ij i bjj . 



Note: 

It may occur, in a calculation, that a matrix has to be subtracted from itself, in which case 
we would obtain another matrix of the same order but whose elements are all zero. This 
type of matrix is called a“null matrix” and may be denoted, for short, by [0] mX n or just 
[0] when the order is understood. 

ILLUSTRATION 

A grocer has two shops and, in each shop, he sells apples, oranges and bananas. The sales, 
in kilogrammes, of each fruit for the two shops on two separate days are represented by the 
matrices 


'36 25 10 
.20 30 15 

where the rows refer to the two shops and the columns refer to apples, oranges and bananas 
respectively. 

The total sales, in kilogrammes, of each fruit for the two shops on both days together are 
represented by the matrix 


40 30 12 
22 35 20 
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' 76 55 22 ' 

.42 65 35. ' 

The differences in sales of each fruit for the two shops between the second day and the first 
day are represented by the matrix 


'4 5 2' 

.2 5 5. 

3. Additive Identities and Additive Inverses 

For the sake of completeness, we mention here that 

(a) A null matrix, when added to any other matrix of the same order, leaves that other 
matrix identically the same as it was to start with. For this reason, a null matrix behaves 

as an “additive identity” . 

(b) If any matrix A is added to the matrix —A (that is, the matrix obtained from A by 
reversing the signs of all the elements) the result obtained is the corresponding null matrix. 
For this reason —A can be called the “additive inverse” of A. 

9.1.4 EXERCISES 


1. State the order of each of the following matrices: 


a 


3 0 
2 4 


;(b) 


(c) [3 5 4 . 


Two of the following matrices are equal to each other. Which are they ? 

'sin 60° tan 30° 1 , . cos 60° ^ 

. tan 45° sin 90° J ’ ^ sin 90° cos 90 c 


r i 

2 

1 


f 


;(b) 


3. Determine the values of x, y and z given that 


4. If A 


'1 

4 


-3 2 
0 1 




1 \x + y 

y + z 


'5 

r 


yx + z 3x — 2y \ 


.6 

0. 


4 3 2' 

and C 


-1 

4 

L -2' 

-10 2 

— 

0 

f 

0 


determine the elements of the following matrices: 


(a) A + B; (b) (A + B) + C; (c) A + (B + C); 
(d) A - B - C; (e) A T + B T ; (f) (A + B) T . 
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5. Determine which pairs of the following matrices can be added and, for those which can, 
state the sum: 



r 1 1 


'4' 

A = 

1 

2 

,B = 

2 




1 


E = 


, C = [4 2 1], D = 


0 0 
0 0 


"2" 


'2 3' 

, G = 

i 


i 

0 

1 


,F = 

.5 0. 

0 

j H = 

o 

i 



i 

i 


1 

o 
1 


, J = [3 3 3 . 


9.1.5 ANSWERS TO EXERCISES 


1. (a) 2 x 3; (b) 2 x 1; (c) 1 x 3. 

2. (a) and (c) are equal to each other. 

3. x = 2, y = 3 and z — 4. 


4. (a) 


5 0 4 
3 0 3 


(b) 


4 4 
3 1 


; (c) 


(d) 

'-2 -10 2 ' 
5 -1 -1. 

;(e) 

'5 3' 
0 0 

;(f) 

'5 3' 
0 0 




4 3 


4 3 


4 

i 



Q 1 


'5' 

5. A + E = E + A = 

o 

3. 

,B+G=G+B= 

2 



■ 

2 

C + J = J + C=[7 1 

5 4], D + F = F + D = 


'l' 


'4 

G+H=H+G= 

0 

,B+H=H+B= 

2 


1 


1 


That is, twelve possible additions but only six distinct results. 



“JUST THE MATHS” 


UNIT NUMBER 
9.2 

MATRICES 2 
(Further matrix algebra) 

by 

A. J. Hobson 


9.2.1 Multiplication by a single number 

9.2.2 The product of two matrices 

9.2.3 The non-commutativity of matrix products 

9.2.4 Multiplicative identity matrices 

9.2.5 Exercises 

9.2.6 Answers to exercises 




UNIT 9.2 - MATRICES 2 - THE ALGEBRA OF MATRICES (Part Two) 
9.2.1 MULTIPLICATION BY A SINGLE NUMBER 


If we were required to multiply a matrix of any order by a positive whole number, n, 
we would clearly regard the operation as equivalent to adding together n copies of the given 
matrix. Thus, in the result, every element of this given matrix would be multiplied by n; 
but it is logical to extend the idea to the multiplication of a matrix by any number, A, not 
necessarily a positive whole number, the rule being to multiply every element of the matrix 
by A. 

In symbols we could say that 
where 




bij \dij . 


Note: 

The rule for multiplying a matrix by a single number can also be used in reverse to remove 
common factors from the elements of a matrix as illustrated as follows: 


ILLUSTRATION 


' 5 10' 

15 20 



2 ' 
4. ' 


9.2.2 THE PRODUCT OF TWO MATRICES 

The definition of a matrix product is more difficult to justify than the previous concepts, 
partly because it is by no means an obvious definition and partly because we cannot be sure 
exactly what originally led to the making of the definition. 

Some hint is given by the matrix equation at the end of the introduction to Unit 9.1, where 
the product of a 2 x 2 matrix and a 2 x 1 matrix is another 2x1 matrix; but we must be 
prepared to meet other orders of matrix as well. 

We shall introduce the definition with a semi-practical illustration, then make a formal 
statement of the definition itself. 

ILLUSTRATION 

A motor manufacturer, with three separate factories, makes two types of car, one called 
“standard” and the other called “luxury”. 

In order to manufacture each type of car, he needs a certain number of units of material and 
a certain number of units of labour each unit representing £300. 
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A table of data to represent this information could be 


Type 

Materials 

Labour 

Standard 

12 

15 

Luxury 

16 

20 


The manufacturer receives an order from another country to supply 400 standard cars and 
900 luxury cars; but he distributes the export order amongst his three factories as follows: 


Location 

Standard 

Luxury 

Factory A 

100 

400 

Factory B 

200 

200 

Factory C 

100 

300 


The number of units of material and the number of units of labour needed by each factory 
to complete the order may be given by another table, namely 


Location 

Materials 

Labour 

Factory A 

100 x 12 + 400 x 16 

100 x 15 + 400 x 20 

Factory B 

200 x 12 + 200 x 16 

200 x 15 + 200 x 20 

Factory C 

100 x 12 + 300 x 16 

100 x 15 + 300 x 20 


If we now replace each table by the corresponding matrix, the calculations appear as the 
product of a 3 x 2 matrix and a 2 x 2 matrix. That is, 


100 

400 

200 

200 

100 

300 


'12 15' 
16 20 


100 x 12 + 400 x 16 
200 x 12 + 200 x 16 
100 x 12 + 300 x 16 


100 x 15 + 400 x 20 
200 x 15 + 200 x 20 
100 x 15 + 300 x 20 


7600 9500 
5600 7000 
6000 7500 


3x2 2x2 3x2 3x2 

OBSERVATIONS 

(i) The product matrix has 3 rows because the first matrix on the left has 3 rows. 


(ii) The product matrix has 2 columns because the second matrix on the left has 2 columns. 

(iii) The product cannot be worked out unless the number of columns in the first matrix 
matches the number of rows in the second matrix with no elements left over in the pairing-up 
process. 


(iv) The elements of the product matrix are systematically obtained by multiplying (in pairs) 
the corresponding elements of each row in the first matrix with each column in the second 
matrix. To pair up the correct elements, we read each row of the first matrix from left to 
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right and each column of the second matrix from top to bottom. 

The Formal Definition of a Matrix Product 

If A and B are matrices, then the product AB is defined (that is, it has a meaning) only 
when the number of columns in A is equal to the number of rows in B. 

If A is of order m x n and B is of order n x p, then AB is of order m x p. 

To obtain the element in the i-th row and j-th column of AB, we multiply corresponding 
elements of the i-th row of A and the j-th column of B, then add up the results. 

ILLUSTRATION 

t 2 r 

.0 -1 4. • 

Note: 

Confusion could arise when multiplying a matrix of order 1 x 1 by another matrix. Appar- 
ently, the other matrix would need to have either a single row or a single column depending 
on the order of multiplication. 

However, as stated in Unit 9.1, a matrix of order 1 x 1 is considered to be a special case, 
and is defined separately to be the same as a single number. Hence a matrix of any order 
can be multiplied by a matrix of order lxl even though this does not fit the formal rules 
for matrix multiplication in general. 

9.2.3 THE NON-COMMUTATIVITY OF MATRIX PRODUCTS 

In elementary arithmetic, if a and b are two numbers, then ab = ba (that is, the product 
“commutes”). But this is not so for matrices A and B as we now show: 

(a) If A is of order m x n, then B must be of order n x m if both AB and BA are to be 
defined. 

(b) AB and BA will have different orders unless m = n. in which case the two products will 
be square matrices of order m x m. 

(c) Even if A and B are both square matrices of order m x m, it will not normally be the 
case that AB is the same as BA. A simple numerical example will illustrate this fact: 


10 16 
3 2-11 

1-204 


2 -1 12 
-10 1 15 


'2 

1' 


O 

t-H 

1 


' 0 

5 ' 

• Knl 

o 

t-H 

1 


'2 

1' 


1 

to 

1 

.3 

7. 


. 2 5. 


.11 

35. 

, DUX 

. 2 5. 


.3 

7. 


.19 37 J 
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Notes: 

(i) If we simply wanted to show that AB ^ BA, we would need only to demonstrate that 
one pair of corresponding elements were unequal in value. 

(ii) If such a basic rule of elementary arithmetic is false for matrices, we should, strictly 
speaking, be prepared to justify other basic rules of arithmetic. But it turns out that the 
non-commutativity of matrix products is the only one which causes problems. 

For instance, it can be shown that, provided the matrices involved are compatible for addition 
or multiplication, 

A + B = B + A; the “Commutative Law of Addition” . 

A + (B + C) = (A + B) + C; the “Associative Law of Addition” . 

A(BC) = (AB)C; the “Associative Law of Multiplication”. 

A(B + C) = AB + BC or (A + B)C = AC + BC; the “Distributive Laws”. 

(iii) In the matrix product, AB, we say either that B is “pre-multiplied” by A or that A 

is “post-multiplied” by B. 

9.2.4 MULTIPLICATIVE IDENTITY MATRICES 

In connection with matrix multiplication and, subsequently, the solution of simultaneous 
linear equations, an important type of matrix is a square matrix with the number 1 in each 
position of the leading diagonal, but zeros everywhere else. Such a matrix is denoted by I„ 
if there are n rows and (of course) n columns. If it is not necessary to specify the number of 
rows and columns, the notation I, without a subscript, is sufficient. 

ILLUSTRATIONS 






o 

o 

o 

'1 O' 

1— 1 
CO 

III 

10 0 

III 
1 — 1 

£ 

c3 

0 10 0 

.0 1. 

0 10 
0 0 1 

0 0 10 




1 

G 

c 

c 


Any matrix of the type I n multiplies another matrix (with an appropriate number of rows 
or columns) to leave it identically the same as it was to start with. For this reason, I n is 
called a “multiplicative identity matrix”, although we normally call it just an “identity 
matrix” (unless it becomes necessary to distinguish it from the additive identity matrix 
referred to earlier). Another common name for it is a “unit matrix”. 
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For example, suppose that 


A 


a i b\ 

b>2 

a 3 b 3 


Then, post- multiplying by I 2 , it is easily checked that 

AI 2 = A. 


Similarly, pre-multiplying by I 3 , it is easily checked that 

I 3 A = A. 


In general, if A is of order m x n, then 

AI n = I m A = A. 


9.2.5 EXERCISES 


-1 4 -2' 

0 1 0 J ’ 

determine the elements of the following matrices: 

(a) A + 2B; (b) A + 2B - 3C; (c) 3A T - B T . 


1. If A = 


1 

4 


-3 2 
0 1 


B = 


4 3 2 

-10 2 


and C = 


2. Remove a common factor from each of the following matrices in order to express it as 
the product of a number and a matrix: 


' 8 

-r 

;(b) 

' —x 3 4 

—x 2 

-32 

16 . 

x 2 

—4x 2 _ 


3. State the order of the product matrix in each of the following cases: 

(a) A 1 x 2 .B 2 x 2 ; (b) A 3 x 1 .B 1x2 ; (c) A4 X 3.B 3x5 . 

4. For the matrices A 2 X 2 , B 2 X 3 , C 3x3 and D 2X 4 , which of the following products are defined, 
(a) A.B; (b) B.C; (c) C.D; (d) A.C; (e) A.D. 
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5. Determine the elements of the product matrix in each of the following: 
f, [1 2] [2 1] , . |1 [12 241 ,J0 4 -31 2 6 , 

^[3 4 J ' [4 1 J ' [24 36j’( Hi -1 0 J ‘ Hi 7 ’ 


-1 7 


1 2 
(d) 3 0 

1 4 


7 -3' 

-1 4 _ 

'i 0 r 
0 1 0 


; (e) [4 2 1]. 3 ; (f) 3 .[4 2 1]; 


(g) [-1 2 ] • [ n ? nl;(h) 4 6 7. -0.5 -3 2.5 . 

L J [5 8 9j [ 1 1-1 

6. If A = ^ ~^ 9 6 , verify that A.A T = I 2 . 


cos 9 — sin 9 ' 

sin 9 cos 9 


7. If A = ~ g and B = 2 b , verify that (A.B) T = B T .A T not A T .B T . 


9.2.6 ANSWERS TO EXERCISES 


—1 1 3 

! [9 3 61 /I \ [12 -9 121. 10 n 

1* (a) 2 0 5’ ^ ) 2 3 5 ’ ^ ^ ^ 


2. (a) 4. H 4 X ; (b) -x 2 . 4 • 

3. (a) 1 x 2; (b) 3 x 2; (c) 4x5. 

4. (a), (b) and (e) are defined. 

_ , v [10 7 1 ,, , [14 241 , . [ 23 -33 

5 ‘ ^ 22 15 ’ ^ 27 42 ’ ^ -3 9 



'5 5 ' 


1 

1 

to 

1 

i — 1 

1 

(d) 

21 -9 

3 13 

; (e) [4] defined as 4; (f) 

12 6 3 

8 4 2 


'1 0 O' 

(g) [-1 2 -i];(h) 0 10. 

0 0 1 
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UNIT 9.3 - MATRICES 3 


MATRIX INVERSION AND SIMULTANEOUS LINEAR EQUATIONS 
9.3.1 INTRODUCTION 

In Matrix Algebra, there is no such thing as division in the usual sense of this word since we 
would effectively be dividing by a table of numbers with the headings of the table removed. 

However, we may discuss an equivalent operation called inversion which is roughly sim- 
ilar to the process in elementary arithmetic where division by a value, a, is the same as 
multiplication by -. 

For example, consider the solution of a single linear equation in one variable, x, namely 

rnx = k, 

for which the solution is obviously 

k 

x = — . 
m 

An apparently over-detailed solution may be set out as follows: 

(a) Pre- multiply both sides of the given equation by m -1 , giving 

m~ 1 .(mx ) = m~ l k. 

(b) Use the Associative Law of Multiplication in elementary arithmetic to rearrange this as 

(m~ 1 .m)x = m~ 1 k. 

(c) Use the property that m~ l .m = 1 to conclude that 

1.x = m~ l k. 

(d) Use the fact that the number 1 is the multiplicative identity in elementary arithmetic to 
conclude that 

x = 

We shall see later how an almost identical sequence of steps, with matrices instead of num- 
bers, can be used to solve a set of simultaneous linear equations with what is called the 
“multiplicative inverse” of a matrix. 

Matrix inversion is a concept which is developed from the rules for matrix multiplication. 
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9.3.2 MATRIX REPRESENTATION OF SIMULTANEOUS 
LINEAR EQUATIONS 


In this section, we consider the matrix equivalent of three simultaneous linear equations in 
three unknowns; this case is neither too trivial, nor too difficult to follow. Let the equations 
have the form: 


a\x + b\y + c\z = ki, 
a 2 x + b 2 y + c 2 z = k 2 , 
a 3 x + b 3 y + c 3 z = k 3 . 


Then, from the properties of matrix equality and matrix multiplication, these can be written 
as one matrix equation, namely 


1 

o 

1 


X 


k\ 

a 2 b 2 c 2 


y 

— 

k 2 

_ d 3 b 3 c 3 _ 


z 


_k 3 


which will be written for short in the form 

MX = K. 


Note: 

If we could find a matrix, say N, such that NM = I, we could pre-multiply the above statement 
by it to give 

N(MX) = NK. 


That is, 


(NM)X = NK. 


In other words, 

IX = NK. 


Hence, 


X = NK. 


Clearly N exhibits a similar behaviour to the number rrT 1 encountered earlier and, for this 
reason, a better notation for N would be M -1 . This is what we shall use in the work which 
follows. 


2 



9.3.3 THE DEFINITION OF A MULTIPLICATIVE INVERSE 

The “multiplicative inverse” of a square matrix, M, is another matrix, denoted by M -1 , 
which has the property 

M -1 .M = I. 


Notes: 

(i) It is certainly possible for the product of two matrices to be an identity matrix, as 
illustrated, for example, in the exercises on matrix products in Unit 9.2. 

(ii) It is usually acceptable to call M -1 the “inverse” of M rather than the “multiplicative 
inverse” , unless we wish to distinguish this kind of inverse from the additive inverse defined 
in Unit 9.1. 

(iii) It can be shown that, when M -1 .M = I, it is also true that 

M.M -1 = I, 

even though matrices do not normally commute in multiplication. 

(iv) It is easily shown that a square matrix cannot have more than one inverse; for, suppose 
a matrix A had two inverses, B and C. 

Then, 

C = Cl = C(AB) = (CA)B = IB = B, 
which means that C and B are the same matrix. 

9.3.4 THE FORMULA FOR A MULTIPLICATIVE INVERSE 

So far, we have established the fact that a set of simultaneous linear equations is expressible 
in the form 

MX = K 

and their solution is expressible in the form 

X = M -1 K. 


We therefore need to establish a method for determining the inverse, M L , in order to solve 
the equations for the values of the variables involved. 

Development of this method will be dependent on the result known as “Cramer’s Rule” for 
solving simultaneous equations by determinants, (see Units 7.2 and 7.3) 
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(a) The inverse of a 2 x 2 matrix 

Taking 


and 


we require that 


CL2 


M 

M" 1 




CL\ 

bi 1 

■ 

CL 2 

b-2. 


p 

Q] 


_R 

s . 

p 

Q] 

r 

_R 

s_ 



O' 

1 


Hence, 

cl\P T b\R — 1, 
CL 2 P + b^R = 0 , 
a 1 () T — 0, 
O 2 Q + &2*S’ = 1. 


It is easily verified that these equations are satisfied by the solution 


P = — (1 = - — P = — —P <? - — 

| M | ^ | M | | M | | M | ’ 


where 


I M | = 


a 1 b\ 

<3-2 ^2 

and is called “the determinant of the matrix M”. 


Summary 

The formula for the inverse of a 2 x 2 matrix is given by 


M -1 


1 b-2 - bi ' 

I M L- (1-2 (I\ \ : 


in which we interchange the diagonal elements of M and reverse the sign of the other two 
elements. 


4 



EXAMPLES 


1. Write down the inverse of the matrix 


M = 


5 

2 



Solution 

Since I M 


Check 


41, we have 


M" 1 


^[7 3' 

41 [-2 5 


M -1 .M 


1 

' 7 3' 


'5 -3' 

1 

[41 0 1 


'1 O' 

41 

.-2 5 


.2 7 . 

“ 41 

T— i 

o 
1 


.0 1. 


2. Use matrices to solve the simultaneous linear equations 


3 x + y = 1, 
x — 2 y = 5. 


Solution 

The equations can be written in the form MX = K, where 


M 




and K 


T 

5 


Initially, we must check that M is non-singluar by evaluating its determinant: 


M 1 = 


3 1 

1 -2 


= - 6 - 1 = - 7. 


The inverse matrix is now given by 

M" 1 = 


i r — 2 -l 
7 l-l 3 


Thus, the solution of the simultaneous equations is given by 


' x' 

l 

'-2 -r 


'1' 

1 

'-r 


' 1 ' 

.y. 

^7 

.-1 3 . 


.5. 

~ ~ 7 

. 14 _ 


.-2. 


That is, 


x — l and y = —2. 
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Note: 

Example 2 should be regarded as a model solution to this type of problem but the student 
should not expect every exercise to yield whole number answers. 

(b) The inverse of a 3 x 3 Matrix 

The most convenient version of Cramer’s rule to use here may be stated as follows: 

The simultaneous linear equations 

a\x + biy + c\z = k±, 
a 2 x + b 2 y + c 2 z = k 2 , 
a 3 x + b 3 y + c 3 z = k 3 

have the solution 



x 




y 




2: 




1 


k\ 

bi 

Cl 


CL\ 

k\ 

Cl 


ai 

bi 

k\ 



bi 

Cl 

k 2 

b 2 

c 2 


d2 

k 2 

c 2 


a 2 

b 2 

k 2 


Cl2 

b 2 

c 2 

k 3 

h 

c 3 


a 3 

k 3 

c 3 


a 3 

b 3 

k 3 


a 3 

b 3 

c 3 


METHOD 

(i) We observe, first, that the last determinant in the Cramer’s Rule formula contains the 
same arrangement of numbers as the matrix M; as in (a), we call it “the determinant of 
the matrix M” and we denote it by j M |. 

In this determinant, we let the symbols A 3 , A 2 , A 3 , B i, B- 2l B 3 , C \ , C 2 and C 3 denote 
the “cofactors” (or “signed minors”) of the elements aq, a 2 , a 3 , b±, b 2 , b 3 , Ci, c 2 and c 3 
respectively. 

(ii) We now observe that, for each of the elements k L , k 2 and k 3 in the first three determinants 
of Cramer’s Rule, the cofactor is the same as the cofactor of one of the elements in the final 
determinant | M |. 

(iii) Expanding each of the first three determinants in Cramer’s Rule along the column of 
h-values, the solutions for x , y and z can be written as follows: 

X = | M | ^ ^ 2^2 + k 3 A 3 ) ; 
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1 


y = 

z = 


M 

1 

M 


{k\Bi + /C2-B2 + ; 


( k\C\ + k 2 C 2 + k^C ^) ; 


or, in matrix format, 


x 

y 

z 


1 

to 

CO 


k\ 

B\ B 2 B3 


k 2 

1 

0 

S 

P 

1 


1 

co 

1 


Comparing this statement with the statement 

X = Mr 1 K, 

we conclude that 

Ai A2 A3 
B\ B2 B3 

cj c 2 c 3 



Summary 

Since it is known that Cramer’s Rule is applicable to any number of equations in the same 
number of unknowns, similar working would occur for larger or smaller systems of equations. 
In general, the inverse of a square matrix is the transpose of the matrix of cofactors 
times the reciprocal of the determinant of the matrix. 

Notes: 

(i) If it should happen that | M | = 0, then the matrix M does not have an inverse and is said 
to be “singular”. In all other cases, it is said to be “non-singular”. 

(ii) The transpose of the matrix of cofactors is called the “adjoint” of M, denoted by AdjM. 
There is always an adjoint though not always an inverse; but, when the inverse does exist, 
we can say that 

M -1 = — ^AdjM. 
j M | J 

(iii) The inverse of a matrix of order 2x2 fits the above scheme also. The cofactor of each 
element will be a single number associated with a “place-sign” according to the following 
pattern: 


+ - 
- + ' 
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Hence, if 


then, 


M 


~«i b \ " 

,a 2 M ’ 


M _1 


1 

ai &2 ~~ 02^1 




As before, we see that the matrix part of the result can be obtained by interchanging the 
diagonal elements of M and reversing the signs of the other two elements. 

EXAMPLE 


Use matrices to solve the simultaneous linear equations 


3 x + y — z = 1, 
x — 2y + z = 0, 
2x + 2y + z = 13. 


Solution 

The equations can be written in the form MX = K, where 

'3 1 -il \x] [ 1 ' 

M = 1 — 2 1 X = y and K = 0 

2 2 1 J [l3 

Initially, we must check that M is non-singluar by evaluating its determinant: 

3 1 -1' 

| M |= 1 -2 1 = 3(— 2 — 2) — 1(1 — 2) + (— 1)(2 + 4) = —17. 

2 2 1 

If C denotes the matrix of cofactors, then 

HI 1 0 

U - -3 E -4 . 

ED — 4 E3 
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Notes: 

(i) The framed elements indicate those for which the place sign is positive and hence no 
changes of sign are required to convert the minor of the corresponding elements of M into 
their cofactor. It is a good idea to work these out first. 

(ii) The remaining four elements are those for which the place sign is negative and so the 
cofactor is minus the value of the minor in these cases. If these, too, are worked out together, 
there is less scope for making mistakes. 

(iii) It is important to remember that, in finding the elements of C, we do not multiply 
the cofactors of the elements in M by the elements themselves. 

The transpose of C now provides the adjoint matrix and hence the inverse as follows: 


M" 1 





-1 

-4 

-7 


Thus, the solution of the simultaneous equations is given by 


X 

y 

1 

'-4 

1 

-3 

5 

-1' 

-4 


' 1 ' 
0 

1 

1 1 
Oi 

i- 1 -a 


V 

3 

z 

-17 

6 

-4 

-7 


13 

~ -17 

-85 


5 


That is, 


x = 1 


y = 3 and z — 5. 


Note: 

Once again, the example should be regarded as a model solution to this type of problem but 
the student should not expect every exercise to yield whole number answers. 

9.3.5 EXERCISES 

1. Determine, where possible, the multiplicative inverses of the following matrices: 


\ 

'1 5' 


'-2 1' 


'-1 2' 

a) 

.7 9. 

; 0) 

.-6 3. 

; (c) 

1 2. 



14-2 


1 

a> 

CO 
1 


11 3 5 

(d) 

-3 2 -7 
1 0 5 

;(e) 

0-5 2 
114 

; (f) 

6 2 2 
10 3 4 
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2. Use a matrix inverse method to solve the following sets of simultaneous linear equations: 


(a) 

4x — 3y = 9, 
3a; — 2 y = 7; 

(b) 

x + 2y = —2, 
5 x — 4y = 3; 


3x + 2y + z = 3, 
5x + 4y + 3z = 3, 
6 x + y + z = 5; 

(d) 

x + 2y + 2z = 0, 
2x + by + 4z = 2, 
x — y — 6z = 4; 


3x + 2y — 2z = 16, 

4x + 3y + 3z = 2' 

2x — y + z = —1; 

(f) 

2 (i 3 - u) + 5(i 3 - h) = 4, 
(*2 — * 3 ) + 2^2 + (i 2 — *l) = 0, 

5(U — * 3 ) + 2(ii — i 2 ) + 12 = 1- 
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'4 0 ll T-l 2 1 ' 

3. If A = 2 1 3 and C = 3 1 -4 , 

— 1 —2 5j [ 0 5 -2 

determine a matrix, B, such that A.B = C. 


9.3.6 ANSWERS TO EXERCISES 


1. (a) - 2 ' 6 _° 7 x 5 ; (b) No Inverse, (c) \ ; 

'10 -20 -24] [ —22 -11 11' 

(d) A 8 7 13 ; (e) — Aj- 2 23 —12 ; (f) There is no inverse. 

[-2 4 14 J [ 5 “3 -3° 

2. (a) x — 3 and y — 1; (b) x — ^ and y — — 1|; (c) x — 1, y — 1 and z = —2; 

(d) x = — |, y = 2 and z = — (e) x — 2, y — | and z 

(f) A = A, j 2 = 1 and i 3 = 

3. 

x r 11 -2 -1 ] [-1 2 1 ] 1 r —17 15 21 ' 

B — — —13 21 —10 . 3 1 -4 = — 76 -55 -77 . 

41 [ -3 8 4 J [ 0 5 -2J 41 [ 27 22 -43 
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UNIT 9.4 - MATRICES 4 - ROW OPERATIONS 


9.4.1 MATRIX INVERSES BY ROW OPERATIONS 

In this section, we shall examine an alternative method for hireling the multiplicative inverse 
of a matrix but the techniques introduced will lead on to other aspects of solving simultaneous 
linear equations not discussed in earlier units. 

DEFINITION 

An “elementary row operation” on a matrix is any one of the following three possibilities: 

(a) The interchange of two rows; 

(b) The multiplication of the elements in any row by a non-zero number; 

(c) The addition of the elements in any row to the corresponding elements in another row. 

Notes: 

(i) Elementary row operations are essentially the same kind of operations as those used in 
the solution of a set of simultaneous linear equations by the method of elimination; but here 
we shall be considering sets of coefficients in the form of matrices rather than complete sets 
of equations. 

(ii) Elementary row operations of types (b) and (c) imply that the elements in any row 
may be subtracted from the corresponding elements in another row and, more generally, 
multiples of the elements in any row may be added to or subtracted from the corresponding 
elements in another row. 

RESULT 1. 

To perform an elementary row operation on a matrix algebraically, we may pre-multiply 
the matrix by an identity matrix on which the same elementary row operation has been 
already performed. For example, in the matrix 

a\ b\ 

«'2 i >2 , 

_ a 3 6 3 _ 

suppose we wished to subtract twice the third row from the second row. It is easy enough to 
carry this out by inspection, but could also be regarded as the succession of two elementary 
row operations as follows: 


1 

0 

0 


1 

0 

0 


1 

1 

0 1-1 


0 1 0 


02 

0 0 1 


1 

0 

0 

LO 

1 


_ a.3 h _ 


1 



'10 o ' 


1 

rO 

cT 

1 


1 

rO 

cT 

1 

0 1 -1 


a 2 b 2 

= 

&2 — 2&3 &2 — 263 

0 0 1 


_2a 3 263 _ 


a 3 63 


DEFINITION 

An “elementary matrix” is a matrix obtained from an identity matrix by performing upon 
it one elementary row operation. 

RESULT 2. 

If a certain sequence of elementary row operations converts a given square matrix, M, into 
the corresponding identity matrix, then the same sequence of elementary row operations in 
the same order will convert the identity matrix into M _1 . 

Proof: 


Suppose that 


E n .E n _i....E4.E 3 .E 2 .Ei.M — I 
where Ei, E 2 , E 3 , E A , E n _i, En are elementary matrices. 

Then, by post-multiplying both sides with M -1 , we obtain 

E n .E n _ 1 ....E A .E 3 .E 2 .E 1 MMT 1 = I.M -1 . 

In other words, 


E n . E n _ i — E 4 . E 3 . E 2 . E \ . I — M 1 , 


which proves the result. 


EXAMPLES 


1. Use elementary row operations to determine the inverse of the matrix 

'3 T 
2 5 ' 


Solution 

First we write down the given matrix side-by-side with the corresponding identity matrix 
in the following format: 

3 7!l0 
2 5i0l‘ 
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Secondly, we try to arrange that the first element in the first column of this arrangement 
is 1; and this can be carried out by subtracting the second row from the first row. 
(Instruction: Ri — > R\ — R 2 ). 


12il-l 

2 5 : o 1 ' 

Thirdly, we try to convert the first column of the display into the first column of the 
identity matrix; and this can be carried out by subtracting twice the first row from the 
second row. 

(Instruction: R 2 — > R 2 — 2Ri)- 

12: 1 -1 
_0 1 i -2 3 

Lastly, we try to convert the second column of the display into the second column of 
the identity matrix: and this can be carried out by subtracting twice the second row 
from the first row. 

(Instruction: Ri — > R\ — 2 R 2 ). 

10: 5 -7 

_0 1 i -2 3 

The inverse matrix is therefore 

' 5 -7' 

_-2 3 J ’ 

as we would have obtained by the cofactor method. 

Notes: 

(i) The technique used for a 2 x 2 matrix applies to square matrices of all orders with 
appropriate modifications. 

(ii) The idea of the method is to obtain, in chronological order, the columns of the 
identity matrix from the columns of the given matrix. We do this by using elementary 
row operations to convert each diagonal element in the given matrix to 1 and then using 
multiples of 1 to reduce the remaining elements in the same column to zero. 

(iii) Once any row has been used to reduce elements to zero, that row must not be used 
again as the operator; otherwise the zeros obtained may change to other values. 
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2. Use elementary row operations to show that the matrix 


has no inverse. 

Solution 

We set np the scheme in the following format: 


13:10 
2 6:01 


Then, we proceed according to the instructions indicated: 
R'2 — * R 2 — 2 R\ 


There is no way now of continuing to convert the given matrix into the identity matrix; 
hence, there is no inverse. 

3. Use elementary row operations to determine the inverse of the matrix 

'4 1 6' 

2 13 . 

3 2 5 


Solution 

We set up the scheme in the following format: 


1 

1 

6 

1 

0 

1 

0 

2 

1 

3 

0 

1 

0 

.3 

2 

5 

0 

0 

1_ 


Then, we proceed according to the instructions indicated: 

R\ R\ — R'i 


1 

-1 

1 

1 

0 

1 

2 

1 

3 

0 

1 

0 

.3 

2 

5 

0 

0 

1. 



FI -2 — * R '2 — 2 R\ and R% — > R% — 3 R\ 


i -i i ; i o-i 

0 3 1 ; -2 1 2 

.0 5 2 : -3 0 4 

R-2 —■ > 2i?2 — -R 3 

1 - 11 : 1 0-1 

0 1 0 1-12 0 

.0 5 2 1-30 4 

R\ — > i?i -|- R2 and R3 — > R3 — 5/?2 


’1 

0 

1 1 

0 

2 

-l" 

0 

1 

0 1 

-1 

2 

0 

.0 

0 

2 1 

2 

-10 

4 . 


i?3 — > i?3 x | 


1 

0 

1 1 

0 

2 

1 

1 

0 

1 

0 1 

-1 

2 

0 

1 

O 

0 

1 1 

1 

-5 

2 . 


i?l — > i?l — i ? 3 

10 0 1—17 - 3 ’ 
0101-12 0 
.0 0 11 1 5 2 . 

The required inverse matrix is therefore 

'-1 7 - 3 ' 

-12 0 . 
1-5 2 
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9.4.2 GAUSSIAN ELIMINATION - THE ELEMENTARY VERSION 


Elementary row operations can also be conveniently used in another method of solving 
simultaneous linear equations which relates closely again to the elimination method some- 
times encountered in courses which do not include matrices. The method will be introduced 
through the case of three equations in three unkowns, but may be applied in other cases as 
well. 

Suppose a set of simultaneous linear equations in the variables x, y and z appeared in the 
special form: 

a\x + biy + c\z = k±, 
b 2 y + c 2 z = k 2 , 
c 3 z = k 3 . 

Then it is very simple to solve the equations by first obtaining z from the third equation, 
substituting its value into the second equation in order to find y, then substituting for both 
y and z in the first equation in order to find x. 

The method of Gaussian Elimination reduces any set of linear equations to this triangular 
form by adding or subtracting suitable multiples of pairs of the equations; but the method 
is more conveniently laid out in a tabular form using only the coefficients of the variables 
and the constant terms. We illustrate with an example. 

EXAMPLE 

Solve the simultaneous linear equations 

2x + y + z = 3, 
x — 2y — z = 2, 

3 x — y + z = 8. 

Solution 

For the simplest arithmetic, we try to arrange that the first coefficient in the first equation 
is 1. In the current example, we could interchange the first two equations. 

0-2-12 
2 113 

3-118 
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This format is known as an “augmented matrix”. The matrix of coefficients has been 
augmented by the matrix of constant terms. 

Using the notation of the previous section, we apply the instructions R 2 — > ► R- 2 — 2 Ri and 
i? 3 — > i? 3 — 3 R | giving a new table, namely 


1 -2 -1 

3 

4 


0 [5 
0 5 


2 

-1 

2 


Next, we apply the instruction R 3 — » i? 3 — R 2 giving 


1 -2 -1 
0 5 3 

0 0 1 


2 

-1 

3 


The numbers enclosed in the boxes are called the “pivot elements” and are used to reduce 
to zero the elements below them in the same column. 


The final table above provides a new set of equations, equivalent to the original, namely 


x — 2y — z = 2, 
by + 3z = — 1, 
z = 3. 


Hence, 


z — 3, y = —2, x — 1 . 


INSERTING A CHECK COLUMN 


In the above example, the numbers were fairly simple, giving little scope for careless mistakes. 
However, with large numbers of equations, often involving awkward decimal quantities, the 
margin for error is greatly increased. 

As a check on the arithmetic at each stage, we deliberately introduce some additional 
arithmetic which has to remain consistent with the calculations already being carried out. 
The method is to add together the numbers in each row in order to produce an extra column; 
each row operation is then performed on the extended rows with the result that, in the new 
table, the final column should still be the sum of the numbers to the left of it. 

The working for our previous example would be as follows: 
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[T] -2-12 0 

2 113 7 

3 -1 1 8 11 

i?2 — * R‘2 — 2 R\, i?3 — ■> i?3 — 3-Ri 

1 - 2-12 0 

0 0 3 -17 

0 5 4 2 11 

i?3 — > i?3 — R 2 

1 - 2-12 0 

0 5 3 -1 7 

0 0 13 4 

9.4.4 EXERCISES 


1. Use elementary row operations to determine (where possible) the inverses 
of the following matrices: 



'1 2 - 1 ' 

(d) 2 0-7; 

3 11 3 

'0 3 -2' 

(e) 1-15; 
15 1 _ 

'2 -3 4 

(f) -13-4; 

10 2 


[-8 -7 -6 -51 



[ 5 6 7 8 J 



2. Use Gaussian Elimination (with a check column) to solve the following sets of simulta- 
neous linear equations: 

(a) 

x + 3y = —8, 

5a; -2 y = 11; 

(b) 

x + 2y = —2, 

5a; — Ay = 3; 


2 x — y + z = 7, 
3x + y — 5z = 13, 
x + y + z = 5; 

(d) 

3a; + 2y — 2z = 16, 
4x + 3y + 3z = 2, 
2x — y + z = — 1. 
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9.4.5 ANSWERS TO EXERCISES 


, n fi oi 


[- 1 2 1. 
(b) j } 




(c) 3 2 \ ; 


.2 _ 


' 77 -17 

-14 

(d) -27 6 

5 

22 -5 

-4 


(e) There is no inverse; 



(g) There is no inverse. 

2. (a) x — 1 and y = —3; 

(b) x= and y = -gj. 

(c) x — 4, y — 1 and z = 0; 

(d) x = 2, y — | and z = 
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UNIT 9.5 - MATRICES 5 - CONSISTENCY AND RANK 


9.5.1 THE CONSISTENCY OF SIMULTANEOUS LINEAR EQUATIONS 
Introduction 

Methods of solving simultaneous linear equations in earlier Units have already shown that 
some sets of equations cannot be solved to give a unique solution. The Gaussian Elimination 
method described in Unit 9.4 is able to detect such situations as illustrated in the work below: 

ILLUSTRATION 1. 

Suppose, firstly, that we were required to investigate the solution of the simultaneous linear 
equations 

3 x — y + z = 1, 

2x + 2y — 5z = 0, 

5a; + y — Az = 7. 

The Gaussian Elimination solution, with check column, proceeds as follows 

3-11 14 

2 2 -5 0 -1 

51 -479 

R\ — * R\ — 7?2 

1-3 6 15 

2 2 -5 0 -1 

5 1 -4 7 9 

i ?2 — * 7?2 — 2 Ri and i? 3 — ■> R 3 — 5i?i 

1-3 6 15 

0 8 -17 -2 -11 

0 16 -34 2 -16 

-R 3 R 3 — 2 T ?2 

1-3 6 15 

0 8 -17 -2 -11 

0 0 0 6 6 

The third line seems to imply that O.z = 6; that is, 0 = 6 which is impossible. 
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Hence the equations have no solution and are said to be “inconsistent” . 

ILLUSTRATION 2. 

Secondly, let us consider the simultaneous linear equations 

3 x — y + z = 1, 

2x + 2y — 5z = 0, 
hx + y — Az = 1, 

which differ from the first illustration in one number only, the constant term of the third 
equation; but the conclusion will be very different. 

This time, the Gaussian Elimination method gives the following sequence of steps: 

3-11 14 

2 2 -5 0 -1 

51 -413 

Ri — ■> R\ — i?2 

1-3 6 15 

2 2 -5 0 -1 

5 1 -4 13 

i ?2 — * R 2 ~~ 2-Ri and i? 3 — > R 3 — 5Ri 

1-3 6 15 

0 8 -17 -2 -11 

0 16 -34 -4 -22 

R 3 —> R 3 — 2i?2 

1-3 6 15 

0 8 -17 -2 -11 

0 0 0 0 0 

The third line here implies that the third equation is redundant since any set of x, y and z 
values would satisfy it. Hence, the original set of equations is equivalent to two equations in 
three unknowns, namely 
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x — 3y + 6z 
8 y — 17 z 


1 , 

- 2 . 


To state a suitable form of the conclusion, we could use the fact that any one of the three 
variables may be chosen at random, the other two being expressed in terms of it. For 
instance, if we choose z at random, then 

3^ + 2 , 17z — 2 

x = — - — and y = . 


However, there is a neater way of arriving at the conclusion. 

Neater form of solution 

Suppose that x = xq, y = yo, z = zo is any known solution to the equations. It could be 
determined, for instance, by starting with z — 0; in this case, z — 0, y — — | and x — 

Let us now substitute 


x = Xi+X 0 , 

y = yi + yo, 

z = Z 1 + Zo, 


from which we obtain 


(aq + xo) — 3(2/i + yo) + 6(zi + zq) = 1, 

8(l/i + Vo) — 17(^i + z 0 ) = —2. 


But, because (xo,yo,zo) is a known solution, this reduces to 


Xi — 3ij] + 6zi = 0, 
8yi - 17^i = 0. 
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This is a set of “homogeneous linear equations” and, although clearly satisfied by ay = 0, 
yi — 0, Z\ — 0, we regard this as a “trivial solution” and ignore it. 

Of more use to us is the fact that an infinite number of non-trivial solutions can be found 
for each of which the variables ay , ay and Z\ are in a certain set of ratios. In the present 
case, from the second equation, we have 


ij\ = y z i which means that y\ : z\ — 17 : 8. 

Substituting into the first equation gives 

Xi — 5 g ' z\ + 6z± = 0 which means that ay = | 2 y; that is, ay : z\ — 3 : 8. 
Combining these conclusions, we can say that 


ay : a/i : Z\ — 3 : 17 : 8 

and any three numbers in these ratios will serve as a set of values for ay, ij\ and z\. 
The neater form of solution can, in general, be written 


X 


x 0 



y 

— 

Vo 

+ Cl 

y\ 

z 


_ z 0_ 


_ z i_ 


where a may be any non-zero number. 
In our present example, 


X 


- 1 - 

4 


' 3 ' 

y 

= 

1 

4 

+ ( y. 

17 

z 


_ 0 _ 


8 


Notes: 

(i) An alternative way of determining the ratios ay : ay : Z\ is to use the fact that, since at 
least one of the variables is going to be non- zero, we may begin by letting it equal 1. 


In the above illustration, suppose we let z\ — \\ then 


V\ = 


17 

¥ 


and ay 


3 

8 ' 


Hence, 


3 17 

ay : 2/i : = - : — : 1, 
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which can be rewritten 


x\ : Vi : Z! = 3 : 17 : 8, 


as before. 

(ii) Should it happen that a set of simultaneous linear equations reduces to only one equation 
(that is, each equation is just a multiple of the first) then a similar procedure can be applied 
as in the following illustration: 

ILLUSTRATION 3. 

Using trial and error, the equation 


3x — 2y + 5z = 6 

has a particular solution xq — 1, yo — 1, zq — 1, so that the general solution is given by 


X = X 0 + Xi, y = y 0 + Vl i z = Z 0 + Z 1, 


where 

3aq — 2 Vl + 5zi = 0. 

We could choose X\ = a and y± = (3 at random for this equation to give Z\ = 2 V' 1- ' . That is, 


X\ 


1 ' 


'O' 

y\ 

= a 

0 

3 

+ f3 

1 

2 

. Zi . 


. 5 - 


. 5 . 


Hence, 


X 


V 


l 


'o' 

y 

Z 

— 

1 

1 

+ CM. 

0 

3 

. 5 - 

+ 

1 

2 

. 5 . 
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9.5.2 THE ROW-ECHELON FORM OF A MATRIX 


In the Gaussian Elimination method to solve a set of simltaneous linear equations, 

MX = K, 

we begin with the augmented matrix M|K and use elementary row operations to obtain more 
zeros at the beginning of each row than at the beginning of the previous row. 

If desired, the first non-zero element in each row could be reduced to 1 by simply dividing 
that row throughout by the value of the first non- zero element. This leads to the following 
definition: 

DEFINITION 

The “row echelon form” of a matrix is that for which the first non-zero element in each 
row is 1 and occurs to the right of the first non- zero element in the previous row. 

Note: 

When using the term “row echelon form” in future, we shall not insist that the first non-zero 
element in each row has actually been reduced to 1. 

9.5.3 THE RANK OF A MATRIX 

The two illustrations of Gaussian Elimination discussed in section 9.5.1 may be used to imply 
that special conclusions are reached when, in the final row echelon form, either a complete 
row of M has reduced to zero (Illustration 1.) or a complete row of M|K has reduced to zero 
(Illustration 2.) This leads to another definition as follows: 

DEFINITION 

The “rank” of a matrix is the number or rows which do not reduce to a complete row of 
zeros when the matrix has been converted to row echelon form. 

ILLUSTRATIONS 

1. In our previous Illustration 1, M had rank 2 but M|K had rank 3. The equations were 
inconsistent. 

2. In our previous Illustration 2, M had rank 2 and M|K also had rank 2. The equations 
had an infinite number of solutions. 

3. In the examples of Unit 9.4 both M and M|K had rank 3. There was a unique solution 
to the simultaneous equations. 


A general summary of these observations may be set out as follows: 
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1. The equations MX = K are inconsistent if rank M < rank M|K. 


2. The equations MX = K have an infinite number of solutions if 
rank M = rank M|K < n where n is the number of equations. 

3. The equations MX = K have a unique solution if rank M = rank M|K = n where n is 
the number of equations. 

9.5.4 EXERCISES 

1. Use Gaussian Elimination to show that the following sets of simultaneous equations are 
inconsistent: 

(a) 

x — y + 2z = 2, 

3 x + y — z = 3, 

5x — y + 3z = 4; 

(b) 

x — y + 2z = 1, 

— x + 3 y — z = —1, 

3x — 7y + Az = 5. 


2. Determine the rank of the following matrices: 

a 1 2 ' 

-1 -3 8 

1 12 -3 ' 

4 -3 7 _ 

3. Determine the general solution of the following equations by reducing the augmented 
matrix to row echelon form: 


'1 

2 

3 4' 


0 

1 

-1 5 

;(b) 

3 

4 

11 2 







x + 3y — z = 
8x + 9y + Az = 
2x + y + 2z = 


7 


6, 

21 , 

3. 



4. State the value of t for which the matrix 


M 


2 1-3 

4 t — 6 
3 t 3 -9 


has rank 1 and determine the general solution of the system of equations 


MX = K, 


where 


X = 

X 

y 

and K = 

' 1 ' 

2 


z 


3 


5. For the system of simultaneous linear equations 

x 1 - x 2 + 2x 3 = 1, 
2xi - x 2 + tx 3 = 2, 
—xi + 2x 2 + x 3 = s, 


determine for which values of s and t there exists 

(a) no solution; 

(b) a unique solution; 

(c) an infinite number of solutions. 

Solve the equations for the two cases s — 1, t — 1 and s — — 1, t — 7. 

6. Determine the values of t for which the matrix 


M 


111 
2 1 t 

3 1 2 -2 


has rank 2. 

For each of these two values of t, solve the system of equations 

MX = K, 



X 


1 

where X = 

y 

z 

and K = 

2 

2 



9.5.5 ANSWERS TO EXERCISES 


1. The rank of the matrix of x, y and z coefficients is less than the rank of the augmented 
matrix. 


2. (a) 2; (b) 3. 


3. 

X 

y 

— 

■ 3 ■ 

5 

9 

5 

+ O' 

—7" 

4 

4. 

z 


_ 0 _ 


5 


and 



'll [ll [0 

1 + a 0 + (3 1 
1 2 1 


5. (a) t 
If s = 
If 5 = 


= 7 and s ^ —1; (b) t ^ 7; (c) t — 7 and s = — 1. 
1 and t = —1, then x\ — x 2 — | and x 3 = 


— 1 and t — 7, then 




+ cx 



o -i 


6. The rank is 2 when t = — 1 or when t — 

If t = — 1, the equations are inconsistent. 

If t = the equations have general solution 


x 1 —5 

y = 0 + a 8 

_ z] [oj [ 3 _ 
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UNIT 9.6 - MATRICES 6 


EIGENVALUES AND EIGENVECTORS 
9.6.1 THE STATEMENT OF THE PROBLEM 

Suppose A is any square matrix, and let X be a column vector with the same number of 
rows as there are columns in A. For example, if A is of order m x m, then X must be of 
order m x 1 and AX will also be of order m x 1 . 

We ask the question: 

“Is it ever possible that AX can be just a scalar multiple of X ?” 

We exclude the case when the elements of X are all zero since, in a practical application, 
these elements will usally be the components of an actual vector quantity; and, if they are 
all zero, the direction of the vector will be indeterminate. 

ILLUSTRATIONS 


'2 2 r 


i 

L- 

i 


1 

b - 

i 

13 1 


-2 

= 

-2 

12 2 


i 

CO 

1 

1 


1 

CO 

1 

1 


'2 2 r 

1 3 1 


v 

i 



'5' 

5 

= 5 

l 

1 

1 

to 

to 
1 


i 


i 

Or 

i 


l 

1 


The formal statement of the problem 

For a given square matrix, A, we investigate the existence of column vectors, X, such that 


AX = AX 


for some scalar quantity A. Each such column vector is called an “eigenvector” of the 
matrix, A; and each corresponding value of A is called an “eigenvalue” of the matrix, A. 
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Notes: 


(i) The German word “eigen 1 ' (meaning “hidden”) gives rise to the above names, but other 
alternatives are “latent values and latent vectors” or “characteristic values and characteristic 
vectors” . 


(ii) In the discussion which follows, A will be, mostly, a matrix of order 3x3, but the ideas 
involved will apply to square matrices of other orders also (see Example 1 and 
Exercises 9.6.3, question 1). 

9.6.2 THE SOLUTION OF THE PROBLEM 


Assuming that 

A = 


«1 

bi 

Cl 


x 

a 2 

b 2 

c 2 

and X = 

y 

_a 3 

b 3 

c 3 _ 


z 


the matrix equation, AX = AX, may be written out fully in the following form: 


aix + biy + C\Z = Xx, 
a 2 x + b 2 y + c 2 z = X y, 

(I 3 X + b 3 y + c 3 z = X z; 

or, on rearrangement, 

(ai — X)x + biy + c.\Z = 0, 
a 2 x + (b 2 - X )y + c 2 z = 0, 
o> 3 x + b 3 y + (c 3 - A)^ = 0, 

which is a set of homogeneous linear equations in x , y and z and may be written, for short, 
in the form 

(A — AI)X = [0], 

where I denotes the identity matrix of order 3x3. 

From the results of Unit 7.4, the condition that the three homogeneous linear equations have 
a solution other than x = 0, y = 0, z = 0 is 
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I A - AI| 


0 . 


o i — A 
0,2 
«3 


bi 

b 2 -X 




Cl 

c 2 

c 3 - A 


This equation will give, on expansion, a cubic equation in A called the “characteristic 
equation” of A. Its left-hand side is called the “characteristic polynomial” of A. 

The characteristic equation of a 3 x 3 matrix, being a cubic equation, will (in general) have 
three solutions. 

EXAMPLES 


1. Determine the eigenvalues and eigenvectors of the matrix 


A 


'2 4' 
5 3 


Solution 

(a) The eigenvalues 

The characteristic equation is given by 


0 = |A — AI| 


2 - A 4 
5 3- A 


A 2 — 5A — 14 = (A + 2)(A — 7). 


The eigenvalues are therefore A = —2 and A = 7. 

(b) The eigenvectors 
Case 1. A = — 2 

We require to solve the equation x + y = 0, 
giving x : y — — 1:1 and a corresponding eigenvector 


X = a 



5 


where a is any non-zero scalar. 

Case 2. A = 7 

We require to solve the equation 5a; — 4 y — 0, 
giving x : y = 4 : 5 and a corresponding eigenvector 
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r 4i 

x =' 3 [ 5 J’ 

where [3 is any non-zero scalar. 

2. Determine the eigenvalues and eigenvectors of the matrix 

'3 2 4' 

A = 2 0 2. 

4 2 3 

Solution 

(a) The eigenvalues 

The characteristic equation is given by 

3- A 2 4 

0 = |A — AI| = 2 -A 2 . 

4 2 3 — A 

Direct expansion of the determinant gives the equation 

-A 3 + 6A 2 + 15A + 8 = 0, 

which will factorise into 

(1 + A) 2 (8 - A) = 0. 

Note: 

Students who have studied row and column operations for determinants (see Unit 7.3) 
may obtain this by simplifying the determinant first. (One way is to subtract the third 
column from the first column and then add the third row to the first row). 

The eigenvalues are therefore A = —1 (repeated) and A = 8. 

(b) The eigenvectors 
Case 1. A = 8 

We require to solve the homogeneous equations 

—5a; + 2y + Az = 0, 

2x — 8y + 2z = 0, 

4a; + 2y — 5z = 0. 
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Eliminating x from the second and third equations gives 18 y — 9z = 0. 

Eliminating y from the second and third equations gives 18a: — 18z = 0. 

Since z appears twice in the two statements, we may try letting z = 1 to give y — \ 
and x — 1. 

Hence, 

1 

x : y : z — 1 : — : 1 = 2 : 1 : 2 
y 2 

The eigenvectors corresponding to A = 8 are thus given by 


X = a 


2 

1 

2 


where a is any non-zero scalar. 

Case 2. A = — 1 

This time, we require to solve the homogeneous equations 


4a: + 2y + Az = 0, 

2x + y + 2z = 0, 

Ax + 2y + Az = 0, 

which, of course, are all the same equation, 2x + y + 2z = 0. 

Hence, two of the variables may be chosen at random (say y = /3 and z = 7), then the 
third variable may be expressed in terms of them; (in this case x = —^/3 — 7). 

However, a neater technique is to obtain, Erst, a pair of independent particular solutions 
by setting one pair of the variables at 1 and 0, in both orders. For example y = 1 and 
z = 0 gives x = — A while y = 0 and z — 1 gives x = —1. 

The general solution (in which y and z are randomly chosen as (3 and 7 respectively) is 
then given by 


X = f3 

1 ■ 

2 

1 

+ 7 

'- 1 ' 

0 


_ 0 _ 


1 


where (3 and 7 are not both equal to zero at the same time. 
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Notes: 


(i) Similar results in Case 2 could be obtained by choosing different pairs of the three 
variables at random. 

(ii) Other special cases arise if the three homogeneous equations reduce to a single equation 
in which one or even two of the variables is absent. 

For example, if they reduced to y — 0, the corresponding eigenvectors could be given by 



V 


'o' 

X = a 

0 

+ 7 

0 


0 


1 


allowing x = a and z = 7 to be chosen at random, assuming that a and 7 are not both zero 
simultaneously. 

Alternatively, if the homogeneous equations reduced to 3a; + 5z = 0, then the corresponding 
eigenvectors could be given by 



1 ' 


'o' 

X = a 

0 

3 

. 5 - 

+ 

1 

0 


allowing x = a and y — (3 to be chosen at random, assuming that a and /3 are not both zero 
simultaneously. 

9.6.3 EXERCISES 

Determine the eigenvalues and eigenvectors of the following matrices: 


1 3' 

2 6 


10-1 
12 1 
2 2 3 
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3. 


11-2 
-12 1 
0 1-1 

4. 

'1 2 2' 

0 2 1 . 
-12 2 

5. 

'0 1 O' 

0 0 1 . 
1 -3 3 

6 . 

'-2 -8 -12 
1 4 4 

0 0 1 

7. 

'-3 -9 -12 
13 4 

0 0 1 

8 . 

'2 2 1' 
13 1 . 
12 2 

9. 

'2 0 O' 

3 2 0. 

4 0 2 

10 . 

'4 -2 O' 
110 . 
1-2 3 
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9.6.4 ANSWERS TO EXERCISES 


The eigenvectors will be written as X T instead of X, in order to avoid unnecessary waste 
space. 

1. A = 0, giving X 1 = a [—3 1], 
where a is any non-zero scalar; 

A = 7, giving X T = (3[1 2], 
where f3 is any non-zero scalar. 

2. A = 1, giving X T = a[l - 1 0], 
where a is any non-zero scalar; 

A = 2, giving X T = /3[-2 1 2], 
where f3 is any non-zero scalar; 

A = 3, giving X T = 7 [-l 1 2], 
where 7 is any non-zero scalar. 

3. A — 1 . giving X 1 = a [3 2 1], 
where a is any non-zero scalar; 

A = -1, giving X T = (3[l 0 1], 
where f3 is any non-zero scalar; 

A = 2, giving X T = 7 [1 3 1], 
where 7 is any non-zero scalar. 

4. A — 1. giving X 1 = a[ 1 1 — 1], 
where a is any non-zero scalar; 

A = 2 (repeated), giving X T = (3[2 1 0], 
where /3 is any non-zero scalar. 

5. A = 1 (repeated), giving X T — a[ 1 1 1], 
where a is any non-zero scalar. 

6. A = 0, giving X 1 = a [— 4 1 0], 
where a is any non-zero scalar; 

A = 2, giving X T = (3[-2 1 0], 
where f3 is any non-zero scalar; 

A = -1, giving X T = y[28 - 8 3], 
where 7 is any non-zero scalar. 



7. A = 1, giving X T = a[- 12 4 1], 
where a is any non-zero scalar; 

A = 0 (repeated), giving X T = (3[— 3 10]+ y[— 4 0 1], 

where /3 and 7 are any scalar numbers which are not simultaneously zero. 

8. A = 5, giving X T — a[ 1 1 1], 
where a is any non-zero scalar; 

A = 1 (repeated), giving X T = (3[— 2 1 0] + y[— 1 0 1], 

where /3 and 7 are any scalar numbers which are not simultaneously zero. 

9. A = 2 (repeated), giving X T = /3[0 1 0] + y[0 0 1] 

where f3 and 7 are any scalar numbers which are not simultaneously zero. 

10. A = 2, giving X T = a[l 1 1], 
where a is any non-zero number; 

A = 3, giving X T = a[ 1 \ 0] + y[0 0 1], 

where a and 7 are any scalar numbers which are not simultaneously zero. 
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UNIT 9.7 - MATRICES 7 

LINEARLY INDEPENDENT AND NORMALISED EIGENVECTORS 
9.7.1 LINEARLY INDEPENDENT EIGENVECTORS 

It is often useful to know if an n x n matrix, A, possesses a full set of n eigenvectors 
Xi, X 2 , X3,. . . , X n , which are “linearly independent”. 

That is, they are not connected by any relationship of the form 

aiXi + 02X2 + 03X3 + . . . =0, 

where Gq, 02, 03, . . . are constants. 

If the eigenvalues of A are distinct, it turns out that the eigenvectors are linearly independent; 
but, if any of the eigenvalues are repeated, further investigation may be necessary. 

ILLUSTRATIONS 

1 . In Unit 9 . 6 , it was shown that the matrix 

'3 2 4 ' 

A = 2 0 2 

4 2 3 

has eigenvalues A = 8 and A = —1 (repeated), with corresponding eigenvectors, 

' 2 ' 

a 1 , 

2 

where a is any non-zero scalar and 

'-II f" 1 ' 

P 1 +7 0 , 

_ 0 J [ 1 _ 

where f 3 and 7 are not both equal to zero at the same time. 
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The matrix, A, possesses a set of three linearly independent eigenvectors which may, 
conveniently, be chosen as 


X, = 

1 

LO 

, X 2 = 

'-1' 

2 

, and X 3 = 

1 


CN 


O 


. i 


It is reasonably obvious that these are linearly independent, but a formal check would 
be to show that the matrix, 

'2 -1 - 1 ' 

12 0 , 

2 0 1 

has rank 3. 

2. It may be shown that the matrix, 

'1 2 2 ' 

0 2 1 , 

-12 2 

has eigenvalues A = 2 (repeated) and A = 1, with corresponding eigenvectors, 



'2' 


1 ' 

a 

1 

and (3 

1 


0 


-1 


where a and (3 are any non-zero numbers. 

In this case, it is not possible to obtain a full set of three linearly independent eigen- 
vectors. 

9.7.2 NORMALISED EIGENVECTORS 

It is sometimes convenient to use a set of “normalised” eigenvectors, which means that, 
for each eigenvector, the sum of the squares of its elements is equal to 1. 

An eigenvector may be normalised if we multiply it by (plus or minus) the reciprocal of the 
square root of the sum of the squares of its elements. 
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ILLUSTRATIONS 


1. A set of linearly independent normalised eigenvectors for the matrix, 

'3 2 4' 

2 0 2 , 

4 2 3 



2. A set of linearly independent normalised eigenvectors for the matrix, 

'1 2 2 ' 

0 2 1 , 

-12 2 



9.7.3 EXERCISES 


1. Determine the eigenvalues and a set of linearly independent normalised eigenvectors for 
the following 2x2 matrices: 


[17 -6 1 [5 -21 ,, [ 

.45 — 16 J ’ [7 — ij : ^ 


16 -8 ' 
24 -12 ' 


2. Determine the eigenvalues and a set of linearly independent normalised eigenvectors for 
the following 3x3 matrices: 

(a) 

2-3 1 

3 13, 

-5 2 -4 
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(b) 

'7 0 4 ' 

7 0 4; 

0 0 11 


2-2 2 
111 ; 
13-1 

(d) 

'-2 0 -14' 
-7 5 -14 ; 
0 0 5 


3 0 0 
14 0 
10 4 


9.7.4 ANSWERS TO EXERCISES 


1. (a) The eigenvalues are 2 and —1. 

A set of linearly independent normalised eigenvectors is 


1 

'2' 

1 

T' 


.5. 

’ v / 10 

.3. 


(b) The eigenvalues are 3 and —2. 

A set of linearly independent normalised eigenvectors is 


1 

T 

1 

'2' 


.1. 

’ a/53 

.7. 


(c) The eigenvalues are 4 and 0. 

A set if linearly independent normalised eigenvectors is 


1 

'2' 

1 

T' 

a/13 

.3. 

’ 71 

.2. 
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2. (a) The eigenvalues are 1, 0 and —2. 

A set of linearly independent normalised eigenvectors is 


' 1 ' 

1 

' 10 ' 

2 

' 4 ' 

0 

1 

3 

and — == 

3 

-1 

’ ^230 

-11 

\/74 

-7 


(b) 


The eigenvalues 
A set of linearly 


v 


V 


'O' 

i 

± 

i 

and 

1 

i 

’ 71 

0 

0 


(c) 


The eigenvalues 
A set of linearly 


'O' 

x 

'4' 

1 

, and . 

1 

1 

’ \/66 

7 


(d) 


The eigenvalues 
A set of linearly 


1 

To 


1 

’ To 


0 

1 


and 


1 

71 


The eigenvalues 
A set of linearly 


are 4 (repeated) and 3. 
independent normalised eigenvectors is 


1 

71 


are 11, 7 and 0. 

independent normalised eigenvectors are 


1 

71 


are 2 (repeated) and —2. 
independent normalised eigenvectors is 


1 

71 


are 5 (repeated) and —2. 
independent normalised eigenvectors is 


'O' 


'O' 

x 

' 1 ' 

1 


0 

and 

-1 

0 

7 

1 

V3 

-1 
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UNIT 9.8 - MATRICES 8 


CHARACTERISTIC PROPERTIES AND 
SIMILARITY TRANSFORMATIONS 

9.8.1 PROPERTIES OF EIGENVALUES AND EIGENVECTORS 

We list, here, a number of standard properties, together with either their proofs or an 
illustration of their proofs. 

(i) The eigenvalues of a matrix are the same as those of its transpose. 

Proof: 

Given a square matrix, A, the eigenvalues of A T are the solutions of the equation 

|A T - AI| = 0. 

But, since I is a symmetric matrix, this is equivalent to 

|(A — AI) T | =0. 

The result follows since a determinant is unchanged in value when it is transposed. 

(ii) The eigenvalues of the multiplicative inverse of a matrix are the reciprocals 
of the eigenvalues of the matrix itself. 

Proof: 

If A is any eigenvalue of a square matrix, A, then 


AX = AX, 

for some column vector, X. 

Premultiplying this relationship by A -1 , we obtain 

A -1 AX = A -1 (AX) = A(A' 1 X). 
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Thus, 


A -1 X = \x. 

A 

(iii) The eigenvectors of a matrix and its multiplicative inverse are the same. 
Proof: 

This follows from the proof of (ii), since 

A -1 X = -X 
A 

implies that X is an eigenvector of A^ 1 . 

(iv) If a matrix is multiplied by a single number, the eigenvalues are multiplied 
by that number, but the eigenvectors remain the same. 

Proof: 

If A is multiplied by a, we may write the equation AX = AX in the form aAX = ctAX. 
Thus, aA has eigenvalues, aA, and eigenvectors, X. 

(v) If Ai, A2, A3, . . are the eigenvalues of the matrix A and n is a positive 

integer, then A”, A£, A3, . . are the eigenvalues of A n . 

Proof: 

If A denotes any one of the eigenvalues of the matrix, A, then AX = AX. 

Premultiplying both sides by A, we obtain A 2 X = AAX = AAA" = A 2 X 
Hence, A 2 is an eigenvlaue of A 2 . 

Similarly, A 3 X = A 3 X, and so on. 

(vi) If Ai, A2, A3, ... are the eigenvalues of the n x n matrix A, I is the n x n 

multiplicative identity matrix and k is a single number, then the eigenvalues of 
the matrix A + kl are X 1 + k, A 2 + k, A 3 + k, . . .. 
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Proof: 


If A is any eigenvalue of A, then AX = AX. 
Hence, 


(A + M)X = AX + kX = AX + kX = (A + k)X. 


(vii) A matrix is singular (|A| = 0) if and only if at least one eigenvalue is equal 
to zero. 

Proof: 

(a) If X is an eigenvector corresponding to an eigenvalue, A = 0, then AX = AX = [0]. 

From the theory of homogeneous linear equations (see Unit 7.4), it follows that |A| = 0. 

(b) Conversely, if |A| = 0, the homogeneous system AX = [0] has a solution for X other than 
X = [0]. Hence, at least one eigenvalue must be zero. 

(viii) If A is an orthogonal matrix (AA T = I), then every eigenvalue is either +1 
or —1. 

Proof: 

The statement AA T = I can be written A -1 = A T so that, by (i) and (ii), the eigenvalues 
of A are equal to their own reciprocals. 

That is, they must have values +1 or —1. 

(ix) If the elements of a matrix below the leading diagonal or the elements 
above the leading diagonal are all equal zero, then the eigenvalues are equal to 
the diagonal elements. 

ILLUSTRATION 

An “upper-triangular matrix”, A, of order 3x3, has the form 


A 


a i b\ Ci 

0 b- 2 c 2 

0 0 c 3 
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The characteristic equation is given by 


0= |A — AI| 


a i — A b\ Ci 
0 b - 2 - A c 2 

0 0 C3 — A 


(on — A) (62 — A)(c3 — A). 


Hence, A = a 1, b 2 or c 3 . 

A similar proof holds for a “lower-triangular matrix” . 

Note: 

A special case of both a lower-triangular matrix and an upper-triangular matrix is a diagonal 
matrix. 

(x) The sum of the eigenvalues of a matrix is equal to the trace of the matrix 
(the sum of the diagonal elements) and the product of the eigenvalues is equal 
to the determinant of the matrix. 

ILLUSTRATION 

We consider the case of a 2 x 2 matrix, A, given by 


A 


a 1 b\ 
a 2 b 2 


The characteristic equation is 


0 


a\ — A bi 
a 2 b 2 — A 


A^ — (oq + b 2 ) A + (aib 2 — a 2 b\). 


But, for any quadratic equation, aX 2 + b\ + c = 0, the sum of the solutions is equal to —b/a 
and the product of the solutions is equal to c/a. 

In this case, therefore, the sum of the solutions is a\ + b 2 , while the product of the solutions 
is a\b 2 — a 2 b\. 
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9.8.2 SIMILAR MATRICES 


In the previous section, a matrix and its transpose illustrated how two matrices can have 
the same eigenvalues. In this section, we deal with a more general case of this occurrence. 

DEFINITION 

Two matrices, A and B, are said to be “similar” if 

B = P^AP, 


for some non-singular matrix, P. 

Notes: 

(i) P is certainly square, so that A and B must also be square and of the same order as P. 

(ii) The relationship B = P _1 AP is regarded as a “transformation” of the matrix, A, into 
the matrix, B. 

(iii) A relationship of the form B = QAQ -1 may also be regarded as a similarity transfor- 
mation on A, since Q is the multiplicative inverse of Q _1 . 

THEOREM 

Two similar matrices, A and B, have the same eigenvalues. Furthermore, if the similarity 
transformation from A to B is B = P _1 AP, then the eigenvectors, X and Y, of A and B 
respectively are related by the equation 


Y = P _1 X. 


Proof: 

The eigenvalues, A, and the eigenvectors, X, of A satisfy the relationship AX = AX. 
Hence, 


P _1 AX = AP _1 AA 


Secondly, using the fact that PP 1 = I, we have 
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P^APP^X = AP _1 X, 


which may be written 

(P' 1 AP)(P~ 1 X) = A(P _1 X) 
or 

BY = AY, 

where B = P _1 AP and Y = P _1 X. 

This shows that the eigenvalues of A are also the eigenvalues of B and that the eigenvectors 
of B are of the form P _1 X. 

Reminders 

a b' 1 _ 1 d ~b. 

_c d\ ad-bc [~c a J ’ 

and, in general, for a square matrix M, 

M -1 = ]M| X the transpose of the cofactor matrix. 

9.8.3 EXERCISES 

1. For the matrix, 

'1 -1 O' 

A = 2 4 0, 

-2 -1 1 

and its multiplicative inverse, A -1 , determine the eigenvalues and a set of corresponding 
linearly independent normalised eigenvectors. 
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2. State the eigenvalues for the upper-triangular matrix 

'2-4 1 ' 

0 3 2 

0 0 -1 

and, hence, obtain a set of linearly independent normalised eigenvectors for the matrix. 

3. State the eigenvalues of the lower-triangular matrix 

'6 0 O' 

3 0 0 

2 1 -10 


and, hence, obtain a set of linearly independent normalised eigenvectors for the matrix. 

4. Determine the eigenvalues and a set of corresponding linearly independent eigenvectors 
for the matrix B = P 1 AP , where 


A 


'1 

2 


3' 

6 


and P 


'4 r 

1 2 


5. Determine the eigenvalues and a set of corresponding linearly independent eigenvectors 
for the matrix B = P _1 AP, where 


A = 


1 

-1 O' 


'2 

-1 3' 

2 

4 0 

and P = 

0 

-2 4 

-2 

-1 1 


5 

1 6 


6. Show that the matrix 


C = 


1 

2 
1 
2 

P2 

2 


1 

2 

1 

2 

x/2 

2 


V2 

2 

_p2 

2 

0 


is orthogonal and verify that its eigenvalues are either 1 or — 1. 


9.8.4 ANSWERS TO EXERCISES 


1. 


The eigenvalues of A are 3, 2 and 1 


with corresponding normalised eigenvectors, 


1 

7! 


'-1' 

i 

'-i' 


'O' 

2 

0 

T 

’ 71 

i 

i 

and 

0 

1 
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The eigenvalues of A 1 are |, | and 1, with corresponding normalised eigenvectors the 
same as for A. 

2. The eigenvalues are 3, 2 and —1, with corresponding linearly independent normalised 
eigenvectors, 


VTf 


' 4 ' 


~r 

l 

' 2 ' 

-1 

5 

0 

and - 

2 

0 


0 

3 

-2 


3. The eigenvalues are 6, 0 and —10, with corresponding linearly independent normalised 
eigenvectors, 


'32' 

i 

' 0 ' 


'O' 

16 

1 

10 

and 

0 

5 

’ Vioi 

1 

1 


a/T305 

4. The eigenvalues of B are the same as those of A, namely 0 and 7. 

Also, P- 1 = 

Hence, a set of linearly independent eigenvectors for B is 

and 


2 -7 

-1 4 


-13 

7 


-12 

7 


5. The eigenvalues of B are the same as those of A which, from question 1, are 3, 2 and 1. 
Also, 


-16 9 2 

20 -3 -8 , 

10 -7 — 4_ 

so that a set of linearly independent eigenvectors for B are 


'-17' 


'-27' 


'- 1 ' 

13 

? 

31 

and 

4 

12 


21 
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6. CC T = 1 and the eigenvalues are 1 (repeated) and — 1. 
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UNIT 9.9 - MATRICES 9 


MODAL AND SPECTRAL MATRICES 
9.9.1 ASSUMPTIONS AND DEFINITIONS 

For convenience, we shall make, here, the following assumptions: 

(a) The n eigenvalues, Ai, A 2 , A 3 , . . . , A n , of an n x n matrix, A, are arranged in order of 
decreasing value. 

(b) Corresponding to Ai, A 2 , A 3 , . . . , A n , respectively, A possesses a full set of eigenvectors 
Xi, X 2 , X 3 , . . . , X n , which are linearly independent. 

If two eigenvalues coincide, the order of writing down the corresponding pair of eigenvectors 
will be immaterial. 

DEFINITION 1 

The square matrix obtained by using as its columns any set of linearly independent eigen- 
vectors of a matrix A is called a “modal matrix” of A, and may be denoted by M. 

Notes: 

(i) There are infinitely many modal matrices for a given matrix, A, since any multiple of an 
eigenvector is also an eigenvector. 

(ii) It is sometimes convenient to use a set of normalised eigenvectors. 

When using normalised eigenvectors, the modal matrix may be denoted by N and, for an n x n 
matrix, A, there are 2 n possibilities for N since each of the n columns has two possibilities. 

DEFINITION 2 

If Ai, A 2 , A 3 , . . . , A n are the eigenvalues of an n x n matrix, A, then the diagonal matrix, 


[A, 

0 

0 . 

. 0 ■ 

0 

a 2 

0 . 

. 0 

0 

0 

CO 

. 0 

o • 

0 

0 . 

£ 

' ^ 


is called the “spectral matrix” of A, and may be denoted by S. 
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EXAMPLE 


For the matrix, 

'1 1-2' 

A = -1 2 1 , 

0 1-1 

determine a modal matrix, a modal matrix of normalised eigenvectors and the spectral 
matrix. 

Solution 

The characteristic equation is 

1 - A 1 -2 

-1 2- A 1 =0, 

0 1 -1 - A 

which may be shown to give 

— (1 + A)(l — A)(2 — A) = 0. 

Hence, the eigenvalues are Ai = 2, A 2 = 1 and A 3 = —1 in order of decreasing value. 

Case 1. A = 2 

We solve the simultaneous equations 

—x + y — 2z = 0, 

—x + 0 y + z = 0, 

Ox + y — 3z = 0, 


which give x : y : z — 1:3:1 
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Case 2. A = 1 


We solve the simultaneous equations 


Cte + y — 2z = 0, 
— x + y + z = 0, 
Cte + y — 2z = 0, 

which give x : y : z = 3:2:1 
Case 3. A = — 1 

We solve the simultaneous equations 

2x + y — 2z = 0, 
—x + 3 y + z = 0, 
Ctr + y + CC = 0, 


which give x : y : z — 1:0:1 
A modal matrix for A may therefore be given by 


M 


1 3 1 
3 2 0 
111 


A modal matrix of normalised eigenvectors may be given by 



r 1 

3 

i -| 


WI 

Via 

V2 

N = 

3 

Vn 

2 

Via 

0 


i 

i 

i 


L dn 

Via 

V2 J 
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The spectral matrix is given by 


S 


2 0 0 

0 1 0 

0 0-1 


9.9.2 DIAGONALISATION OF A MATRIX 

Since the eigenvalues of a diagonal matrix are equal to its diagonal elements, it is clear that 
a matrix, A, and its spectral matrix, S, have the same eigenvalues. 

From the Theorem in Unit 9.8, therefore, it seems reasonable that A and S could be similar 
matrices; and this is the content of the following result which will be illustrated rather than 
proven. 

THEOREM 

The matrix, A, is similar to its spectral matrix, S, the similarity transformation being 


M _1 AM = S, 


where M is a modal matrix for A. 

ILLUSTRATION: 

Suppose that X 1; X 2 and X 3 are linearly independent eigenvectors of a 3 x 3 matrix, A, 
corresponding to eigenvalues Ai, A 2 and A 3 , respectively. 

Then, 


AXi 


AiXi, AX 2 — A 2 X 2 , and AX 3 — A 3 X 3 . 


Also, 


X\ 

X 2 

X3 

yi 

2/2 

2/3 

Zl 

Z2 

^3 


If M is premultiplied by A, we obtain a 3 x 3 matrix whose columns are AX 1? AX 2 , 


and AX 3 . 


4 



That is, 


AM = [ AXi AX 2 AX 3 ] = [AiXi A 2 X 2 A 3 X 3 ; 


X{ X 2 x 3 Ai 0 0 

AM = yi 112 IJ 3 ■ 0 A 2 0 = MS. 

_zi z 2 z 3 \ [ 0 0 A 3 _ 

We conclude that 

M _1 AM = S. 

Notes: 

(i) M^ 1 exists only because Xi, X 2 and X 3 are linearly independent. 

(ii) The similarity transformation in the above theorem reduces the matrix, A, to “diagonal 
form” or “canonical form” and the process is often referred to as the “diagonalisation” 
of the matrix, A. 

EXAMPLE 

Verify the above Theorem for the matrix 

'1 1-2' 

A = -1 2 1 . 

_ 0 1-1 

Solution 

From an earlier example, a modal matrix for A may be given by 

'1 3 r 
M = 3 2 0 . 

Ill 
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The spectral matrix is given by 


'2 0 O' 

S= 0 1 0 . 

0 0-1 

It may be shown that 

[-2 2 2 ' 
M -1 = - 3 0-3 

6 [-1 -2 7 


and, hence, 

x [-2 2 2 1 [ 1 1 -2] [1 3 1' 

M _1 AM = - 3 0 -3.-1 2 1.320 

6 [-1 -2 7 J [ 0 1 -lj [l 1 1 


l [-2 2 2 1 [2 3 -1' 

= - 3 0-3.620 

6 [-1 -2 7 J [2 1 -1 


'2 0 O' 

= 01 0 = S. 

0 0-1 
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9.9.3 EXERCISES 


1. Determine a modal matrix, M, of linearly independent eigenvectors for the matrix 

'-1 6 - 12 ' 

A = 0 -13 30 . 

0 -9 20 

Verify that M -1 AM = S, where S is the spectral matrix of A. 

2. Determine a modal matrix, M, of linearly independent eigenvectors for the matrix 

'1 2 2' 

B = 0 2 1. 

-112 

Verify that M -1 AM = S, where S is the spectral matrix of B. 

3. Determine a modal matrix, N, of linearly independent normalised eigenvectors for the 
matrix 

'2 -2 O' 

C= 0 4 0 . 

2 5 1 

Verify that N _1 AN = S, where S is the spectral matrix of C. 

4. Show that the following matrices are not similar to a diagonal matrix: 

' 1 2 2] TO 1 O' 

(a) 0 2 1 , (b) 0 0 1 . 

-1 2 2j |_1 -3 3 
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9.9.4 ANSWERS TO EXERCISES 


1. The eigenvalues are 5,2 and —1, which gives 


M 


-1 0 1 
5 2 0 

3 1 0 


2. The eigenvalues are 2, 1 and —1, which gives 


M 


1 3 1 
3 2 0 
111 


3. The eigenvalues are 4, 2 and 1, which gives 


N 


73 75 0 

A 0 0 

.75 75 1 


4. (a) The eigenvalues are 2 (repeated) and 1 but there are only two linearly independent 
eigenvectors, namely 


'2' 


1 ~ 

1 

and 

1 

0 


-1 


(b) There is only one eigenvalue, 1 (repeated), and only one linearly independent 
eigenvector, namely 
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UNIT 9.10 - MATRICES 10 


SYMMETRIC MATRICES AND QUADRATIC FORMS 
9.10.1 SYMMETRIC MATRICES 

The definition of a symmetric matrix was introduced in Unit 9.1 and matrices of this type 
have certain special properties with regard to eigenvalues and eigenvectors. We list them as 
follows: 

(i) All of the eigenvalues of a symmetric matrix are real and, hence, so are the eigenvectors. 

(ii) A symmetric matrix of order n x n always has n linearly independent eigenvectors. 

(iii) For a symmetric matrix, suppose that X, : and Xj arc linearly independent eigenvectors 
associated with different eigenvectors; then 


X,:Xj = XiXj + DiDj + ZiZj = 0 . 


We say that Xj and Xj are “mutually orthogonal” . 

If a symmetric matrix has any repeated eigenvalues, it is still possible to determine a full 
set of mutually orthogonal eigenvectors, but not every full set of eigenvectors will have the 
orthogonality property. 

(iv) A symmetric matrix always has a modal matrix whose columns are mutually orthogonal. 
When the eigenvalues are distinct, this is true for every modal matrix. 

(v) A modal matrix, N, of normalised eigenvectors is an orthogonal matrix. 

ILLUSTRATIONS 


1. If N is of order 3 x 3, we have 


n t .a = 

X\ U\ 

X2 1J2 

Zl 

Z2 


X\ 

l)\ 

X 2 X 3 
V2 V3 

— 

'1 0 o' 
0 1 0 


%3 ys 

Z 3 


Z 1 

1 

CO 

CM 


0 0 1 


1 



2 . It was shown in Unit 9.6 that the matrix 


A = 


3 2 4 
2 0 2 

4 2 3 


has eigenvalues A = 8, and A = — 1 (repeated), with associated eigenvectors 


a 


'2' 

1 

, and (3 

1 ' 

2 

1 

+ 7 

1 

O 1 
1 

= 

-\(3~1 

P 

2 


0 


1 


7 


A set of linearly independent eigenvectors may therefore be given by 



'2' 


1 _ 

2 


~-l~ 

X 1 = 

1 

2 

, x 2 = 

1 

_ 0 _ 

, and X 3 = 

0 

1 


Clearly, Xi is orthogonal to X2 and X3, but X2 and X3 are not orthogonal to each other. 
However, we may find (3 and 7 such that 


P 

1 _ 

2 

1 

+ 7 

'- 1 ' 

0 

is orthogonal to 

'- 1 ' 

0 


0 


1 


1 


We simply require that 


\p + 27 = 0 

or 

/3 + 47 = 0 ; 

and this will be so, for example, when (3 = 4 and 7 = — 1 . 
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A new set of linearly independent mutually orthogonal eigenvectors can thus be given 
by 


Xi = 

1 

bO 

, X 2 = 

'-r 

4 

, and X 3 = 

1 


CN 


1 




9.10.2 QUADRATIC FORMS 

An algebraic expression of the form 


ax 2 + by 2 + cz 2 + 2 fyz + 2 yzx + 2 hxy 


is called a “quadratic form”. 

In matrix notation, it may be written as 



a h g 


X 

x y z] 

h b f 


y 


_g f c_ 


z 


= X t AX, 


and we note that the matrix A is symmetric. 

In the scientific applications of quadratic forms, it is desirable to know whether such a form 

is 

(a) always positive, 

(b) always negative, 

(c) both positive and negative. 

It may be shown that, if we change to new variables, ( u,v,w ), using a linear transformation 


X = PU, 

where P is some non-singular matrix, then the new quadratic form has the same properties 
as the original, concerning its sign. 

We now show that a good choice for P is a modal matrix, N, of normalised, linearly inde- 
pendent, mutually orthogonal eigenvectors for A. 
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Putting X = NU, the expression X T AX becomes U T N T ANX. 

But, since N is orthogonal when A is symmetric, N T = N -1 and, hence, N T AN is the spectral 
matrix, S, for A. 

The new quadratic form is therefore 


U T SU 


1 

o 

0 

1 


u 

0 A 2 0 


V 

_ 0 0 As _ 


w 


Aim 2 + X 2 v 2 + A 3 w 2 . 


Clearly, if all of the eigenvalues are positive, then the new quadratic form is always positive; 
and, if all of the eigenvalues are negative, then the new quadratic form is always negative. 

The new quadratic form is called the “canonical form under similarity” of the original 
quadratic form. 

9.10.3 EXERCISES 


1. For the following symmetric matrices, determine a set of three linearly independent and 
mutually orthogonal eigenvectors: 


'2 

0 

O' 


'2 

2 

O' 

0 

10 

6 

t (b) 

2 

5 

0 

0 

6 

5 


0 

0 

3 


2. Repeat the previous question for the following symmetric matrices: 



3 

3 

3y/2~ 


'2 

0 0 ' 

a) 

3 

3 

3\/2 

, (b) 

0 

1 2 


?>V2 

3\/2 

6 


0 

2 -2 


3. Using the results of question 1, show that the following quadratic forms are always 
positive: 

(a) 

2x 2 + 10 y 2 + 5z 2 + 12 yz] 


(b) 


2x 2 + 5 y 2 + 3z 2 + Axy. 
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4. Using the results of question 2(b), obtain the matrix, P, of the orthogonal transforma- 
tion, X = PU, which transforms the quadratic function 

2x 2 + y 2 — 2z 2 + 4 xz 


into the quadratic function 


2 u 2 + 2v 2 — 3w 2 . 

State whether or the not the original quadratic form is always positive. 

9.10.4 ANSWERS TO EXERCISES 

1. (a) The eigenvalues are 14, 2 and 1 and a set of linearly independent mutually orthog- 
onal eigenvectors is 


'o' 

3 

1 

'l' 

0 

and 

' 0 ' 
-2 

2 


0 


3 


(b) The eigenvalues are 6, 3 and 1 and a set of linearly independent mutually orthogonal 
eigenvectors is 


'l' 


'O' 


' 2 ' 

2 

0 


0 

1 

and 

-1 

0 


2. (a) The eigenvalues are 12 and 0 (repeated) and set of linearly independent mutually 
orthogonal eigenvectors is 


1 


1 ' 


1 

1 


-1 

and 

1 

.A 


0 


.-A 


(b) The eigenvalues are 2 and —3 (repeated) and a set of linearly independent mutually 
orthogonal eigenvectors is 


'o' 


'l' 


' 0 ' 

2 

1 

0 

and 

1 

1 


0 


-2 


5 



3. The eigenvalues are all positive and hence the quadratic forms are always positive. 

4. 


P 


'01 0 ' 

_2_ n _1_ 

V5 U y/5 

4= o _a_ 

L 75 U 75-1 


The original quadratic form may take both positive and negative values since the asso- 
ciated eigenvalues are not all positive. 
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UNIT 10.1 - DIFFERENTIATION 1 


10.1.1 FUNCTIONAL NOTATION 
Introduction 

If a variable quantity, y, depends for its values on another variable quantity, x, we say that 
“y is a function of x” and we write, in general: 

V = / 0 ), 


which is pronounced “y equals / of x" . 

Notes: 


(i) y is called the “dependent variable” and x is called the “independent variable”. 

The choice of value for x will be arbitrary, within certain possible constraints, but, once it 
is chosen, the value of y is then fixed. 

(ii) The use of the letter / in f(x) is logical because it stands for the word “function”; but 
once the format of the notation is understood, we can use other letters as appropriate. For 
example: 

the statement P = P{T) could be used to indicate that a pressure, P, is a function of 
absolute temperature, T ; 

the statement i — i(t) could be used to indicate that an electric current, i, is a function of 
time t] 

the original statement could have been written y = y(x) without using / at all. 

The general format of functional notation may be described as follows: 

DEPENDENT VARIABLE = 

DEPENDENT VARIABLE (INDEPENDENT VARIABLE) 

10.1.2 NUMERICAL EVALUATION OF FUNCTIONS 

If a is a number, then /(a) denotes the value of the function f(x) when x = a is substituted 
into it. 


For example, if 
then, 



f(x) = 4sin3x, 

37T 1 

4 sin — = 4 x —= = 2.828 
4 
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10.1.3 FUNCTIONS OF A LINEAR FUNCTION 


The notation 

f(ax + b ), 

where a and b are constants, implies a known function, f(x), in which x has been replaced 
by the linear function ax + b. 


For example, if 


f(x) = 3a; 2 — 7x + 4, 


then, 


f(5x - 1) = 3(5a; - l) 2 - 7(5a; - 1) + 4; 


but, in the applications of this kind of notation, it usually best to leave the expression in the 
bracketed form rather than to expand out the brackets and so lose any obvious connection 
between f(x) and f(ax + b). 


10.1.4 COMPOSITE FUNCTIONS (or Functions of a Function) IN GENERAL 


The symbol 


f\g(x)'. 


implies a known function, f(x), in which x has been replaced by another known function, 

g(x)- 


For example, if 

and 

then, 


f(x) = x 2 + 2x — 5 
g(x) = sin a;, 

f[g(x)\ = sin 2 a; + 2 sin a; — 5; 


but we can observe also that 


g[f(x)) = sin(a; 2 + 2x — 5), 
which is not identical to the first result. Hence, in general, 

f[g{x)} # g[f(x)\. 


There are some exceptions to this, however, as in the case when 

f(x) = e x and g(x) = log e a;, 


2 



whereupon we obtain 

f[g(x)} = e loge x = x 

and 

g[f(x)} = log e (e x ) = x. 

The functions log e x and e x are said to be “inverses” of each other. 

10.1.5 INDETERMINATE FORMS 

Certain fractional expressions involving functions can become problematic if the values of 
the variable being substituted into them reduce them to either of the forms 


Both of these forms are meaningless or 
a concept called “limiting values” . 

(a) The Indeterminate Form [j 

Suppose the fractional expression 


is such that both f(x) and g(x) take the value zero when x = a; that is, /(a) = 0 and 
g(a ) = 0. It is impossible, therefore, to evaluate the fraction when x = a\ but we may 
consider its values as x becomes increasingly close to a with out actually reaching it. The 
standard terminology is to say that u x tends to a”, written x — » a, for short. 

We note that, by the Factor Theorem, discussed in Unit 1.8, ( x — a ) must be a factor of 
both numerator and denominator; and it turns out that, by cancelling this common factor 
(which is allowed if x is not going to reach a) we can assign a value to called a limiting 
value. It still will not be the value of this fraction at x — a, but represents the value it 
approaches as x tends to a. The result is denoted by 


0 oo 

- or — . 

0 oo 

“indeterminate” and need to be dealt with using 


f(x) 

g(x) 


lim 

x^a 


JM 

g(x)' 


EXAMPLE 

Calculate 

x — 1 

Inn — . 

x->i x 2 + 2x — 3 
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Solution 


First we factorise the denominator, knowing already that one of its factors must be x — 1 
because it takes the value zero when x — 1. 

The result is therefore 

r — I 

lim — - 

z-u (x — l)(x + 3) 


= lim 


x + 3 


What we now need to establish is the fixed value which this new fraction approaches as x be- 
comes increasingly close to 1. But since there are no longer any problems with indeterminate 
forms, we do in fact simply substitute x — 1, obtaining the number 


Hence, 


x 


lim „ 

x-+i x 2 + 2x — 3 


1 

4 


(b) The Indeterminate Form — 

This kind of indeterminate form is usually encountered when the value of x itself becomes 
infinite, either positively or negatively. The object is to evaluate either 


lim 

tc— >oo 


JM 

g(x) 


or 


lim 

x—>—oo 


JM 

g(x)' 


EXAMPLE 

Calculate 

, 2x 2 + 3x — 1 

Inn — . 

x^oo Yx 2 — 2x + 5 
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Solution 


There is no factorising to do in this type of exercise; we simply divide the numerator and 
the denominator by the highest power of x appearing, then allow x to become infinite. 


The result is therefore 


lim 

X — XX) y — 



xl 

JL 

X - 2 


2 

r 


Note: 

In the case of the ratio of two polynomials of equal degree, the limiting value as x — > ±oo 
will always be the ratio of the leading coefficients of x. The same principle can be applied 
to the ratio of two polynomials of unequal degree if we insert zero coefficients in appropriate 
places to consider them as being of equal degree. The results then obtained will be either 
zero or infinity. 

ILLUSTRATION 


, 5a: + 11 Oa: 2 + 5a: + 11 0 

lim — = lim = - = 0. 

x ^°° 3x 2 — 4a: + 1 x ^°° 3ar — 4a: + 1 3 

A Useful Standard Limit 

In Unit 3.3, it is shown that, for very small values of x in radians, sin a: ~ x. 
This suggests that 


, sin x 
lim = 1, 

x^O X 

though we may not actually use the result from Unit 3.3 to prove the validity of this new 
limiting value. We shall see later that “sin a: ~ x for small x” is developed from a calculus 
technique which assumes that — > 1 as x — » 0; so, we would be using the result to prove 
itself ! 

An alternative, non-rigorous, proof is to consider the following diagram in which the angle 
x is situated at the centre of a circle with radius 1: 
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c 


In the diagram, the length of line AB = sin x and the length of arc BC = x. Furthermore, as 
x decreases almost to zero, the two lengths become closer and closer to each other in value. 
That is, 

sin x — > x as x — > 0 


Sill x 

lim = 1 

x^O x 

10.1.6 EVEN AND ODD FUNCTIONS 

It is easy to see that any even power of x will be unchanged in value if x is replaced by 
— x. In a similar way, any odd power of x will be unchanged in numerical value, though 
altered in sign, if x is replaced by —x. These two powers of x are examples of an “even 
function” and an “odd function” respectively; but the true definition includes a wider 
range of functions as follows: 

DEFINITION 

A function f(x) is said to be “even” if it satisfies the identity 

f(~x) = f(x). 

ILLUSTRATIONS: x 2 , 2x 6 - 4x 2 + 5, cos*. 
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DEFINITION 

A function f(x) is aid to be “odd” if it satisfies the identity 

f(—x) = -f(x). 

ILLUSTRATIONS a: 3 , x 5 — 3a; 3 + 2x , sin a;. 

Note: 

It is not necessary for every function to be either even or odd. For example, the function 
x + 3 is neither even nor odd. 

EXAMPLE 

Express an arbitrary function, f(x), as the sum of an even function and an odd function. 

Solution 

We may write 

t , s_f( x ) + f(~ x ) , f(x)~f(~x ) 

J{ x) 2 + 2 

in which the first term on the right hand side is unchanged if x is replaced by —x and the 
second term on the right hand side is reversed in sign if x is replaced by —x. 

We have thus expressed f(x) as the sum of an even function and an odd function. 

(ii) GRAPHS OF EVEN AND ODD FUNCTIONS 


(i) The graph of the relationship y = f(x), where f(x) is even, will be symmetrical about 
the y-axis since, for every point (x,y) on the graph, there is also the point (—x,y). 
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.X 


(ii) The graph of the relationship y = f(x), where f(x) is odd, will be symmetrical with 
respect to the origin since, for every point (x, y ) on the graph, there is also the point (—x, —y). 
However, odd functions are more easily recognised by noticing that the part of the graph for 
x < 0 can be obtained from the part for x > 0 by reflecting it first in the x-axis and then in 
the y-axis. 





.x 



EXAMPLE 

Sketch the graph, from x = —3 to x = 3, of the even function, f(x), defined in the interval 
0 < x < 3 by the formula 


f(x) = 3 + x 3 . 



Solution 



ALGEBRAIC PROPERTIES OF ODD AND EVEN FUNCTIONS 

1. The product of an even function and an odd function is an odd function. 

Proof: 

If f(x) is even and g(x) is odd, then 

f(~x)-g(-x) = f(x).[-g(x)] = -f(x).g(x). 

2. The product of an even function and an even function is an even function. 

Proof: 

If f(x) and g{x) are both even functions, then 

f(-x)-g(-x) = f(x).g(x). 

3. The product of an odd function and an odd function is and even function. 

Proof: 

If f(x) and g(x) are both odd functions, then 

f(-x).g(-x) = [-f(x)].[-g(x)] = f(x).g(x). 
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EXAMPLE 


Classify the function f(x) = sin 4 x. tanx as even, odd or neither even nor odd. 

Solution 


f(—x) = sin 4 (— x). tan(— x) = sin 4 x.[— tanx] = — sin 4 x. tanx. 


The function, f(x), is therefore odd. 


10.1.7 EXERCISES 

L If 

f(x) = 3 + cos 2 (|) 
determine the values of /(0), f(n), / (|). 


2. If 

verify that 


f(x) = 2x and g(x) = x 2 , 


/[0(a)] # #L/M- 


3. If 

/(x) = x 2 — 4a; + 6, 

verify that 

/( 2-x) = f(2 + x). 

4. Determine simple functions f(x) and g{x) such that the following functions can be 
identified with f[g(x)]: 

(a) 

3(x 2 + 2) 3 ; 

(b) 

(x 2 + l) - ^; 

(c) 

cos 2 a;. 
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5. Evaluate the following limits: 


(a) 

(b) 


(d) 


x 2 + 2x — 3 

lim — : 

x ^i a; 2 — 8a: + 7 


lim — 

x—>2 x 2 


x — 2 
-(x + 2 y 


lim 

x — >oo 


3x 2 + 5a: — 4 
5a: 2 — x + 7 ’ 


lim 

r—>—oo 


(2 r + l) 2 
(r — 1 )(r + 3) ’ 


lim 


sin 3 9 
9 


6. Express the function e x as the sum of an even function and an odd function. 

7. Sketch the graph, from x = —5 to x = 5 of the odd function, f(x), defined in the 
interval 0 < x < 5 by the formula 


f(x) 


TTX 


COS 


10 ' 


8. Classify the function 


f{x) = tan 2 a: + cosec 3 a:. cos a: 
as even, odd or neither even nor odd. 

10.1.8 ANSWERS TO EXERCISES 

1. 4, 3, l 

2. 2a: 2 ^ (2a:) 2 . 
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3. Both are identically equal to x 2 + 2. 


4. (a) 






f(x) = 3x 3 

and 

(b) 






f(x) = 

and 

(c) 






f(x) = X 2 

and 

5. (a) 

2. 

3’ 



(b) 

1. 

3’ 



(c) 

3. 

5’ 



(d) 

4; 



(e) 

3. 




g(x) = x 2 + 2 


g(x) = x 2 -\-l 


g(x) = cos a;. 


6 . 


gi| e I e x _ g X 

6 = + 

2 2 

7. The graph is as follows: 



8. The function is neither even nor odd. 
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UNIT 10.2 - DIFFERENTIATION 2 


RATES OF CHANGE 

10.2.1 INTRODUCTION 

The functional relationship 

V = f{x) 

can be represented diagramatically by drawing the graph of y against x to obtain, in general, 
some kind of curve. 

Between one point of the curve and another, the values of both x and y will change, in 
general; and the purpose of this section is to introduce the concept of the rate of increase 
of y with respect to x. 

A convenient practical illustration which will provide an aid to understanding is to think of 
y as the distance travelled by a moving object at time x; because, in this case, the rate of 
increase of y with respect to x becomes the familiar quantity which we know as speed. 

10.2.2 AVERAGE RATES OF CHANGE 

Suppose that a vehicle travelled a distance of 280 miles in 7 hours, a journey which is likely to 
have included short stops, traffic jams, traffic lights and also some fairly high speed motoring. 
The ratio 

280 _ 

~T ~ 

represents the “average speed” of 40 miles per hour over the whole journey. It is a 
convenient representation of the speed during the journey even though the vehicle might not 
have been travelling at that speed very often. 

Consider now a graph representing the relationship, y = f(x), between two arbitrary 
variables, x and y, not necessarily time and distance variables. 


1 



Q(c, d) 


V 



§ 


x 


Between the two points P(a, b ) and Q(c, d ) an increase of c — a in x gives rise to an increase 
of d — b in y. Therefore, the average rate of increase of y with respect to x from P to Q is 

d-b 
c — a 


ff it should happen that y decreases as x increases (between P and Q), this quantity will 
automatically turn out negative; hence, 

all rates of increase which are POSITIVE correspond to 
an INCREASING function, 

and 

all rates of increase which are NEGATIVE correspond to 
a DECREASING function. 

Note: 

For the purposes of later work, the two points P and Q will need to be considered as very 
close together on the graph, and another way of expressing a rate of increase is to consider 
notations such as P(a:,|/) and Q(x + Sx,y + Sy) for the pair of points. 

Here, we are using the symbols Sx and Sy to represent “a small fraction of x” and “a 
small fraction of y”, respectively. We do not mean 6 times x and <5 times y. We normally 
consider that Sx is positive, but Sy may turn out to be negative. 

The average rate of increase in this alternative notation is given by 

Sy_ = f(x + Sx) - f(x) 

Sx Sx 
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In other words, 

The average rate of increase is equal to 

(new value of y) minus (old value of y) 

(new value of x) minus (old value of x ) 


EXAMPLE 

Determine the average rate of increase of the function 

y = x 2 

between the following pairs of points on its graph: 

(a) (3,9) and (3.3,10.89); 

(b) (3,9) and (3.2,10.24); 

(c) (3,9) and (3.1,9.61). 

Solution 

The results are 

M I = T§ = 6-3; 

(b) 5 £ = if = 6-2; 

M £ = ^ = 6.1 

10.2.3 INSTANTANEOUS RATES OF CHANGE 

The results of the example at the end of the previous section seem to suggest that, by- 
letting the second point become increasingly close to the first point along the curve, we 
could determine the actual rate of increase of y with respect to x at the first point only, 
rather than the average rate of increase between the two points. 

In the above example, the indications are that the rate of increase of y = x 2 with respect to 
x at the point (3, 9) is equal to 6; and this is called the “instantaneous rate of increase 
of y with respect to x v at the chosen point. 

The instantaneous rate of increase in this example has been obtained by guesswork on the 
strength of just three points approaching (3,9). In general, we need to consider a limiting 
process in which an infinite number of points approach the chosen one along the curve. 
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This process is represented by 


5 a :— >0 5x 

and it forms the basis of our main discussion on differential calculus which now follows. 

10.2.4 DERIVATIVES 

(a) The Definition of a Derivative 

In the functional relationship 

V = f{x) 

the “derivative of y with respect to x n at any point (x, y ) on the graph of the function 
is defined to be the instantaneous rate of increase of y with respect to x at that point. 

Assuming that a small increase of 5x in x gives rise to a corresponding increase (positive or 
negative) of 5y in y, the derivative will be given by 


,. Sy 
hm — 

6x^0 OX 


lim 

>0 


f(x + Sx) - f(x) 
8x 


This limiting value is usually denoted by one of the three symbols 


d y 

dx’ 


f(x) or 


d 

dx 


L/M- 


Notes: 

(i) In the third of these notations, the symbol ^ is called a “differential operator”; it 
cannot exist on its own, but needs to be operating on some function of x. In fact, the first 
alternative notation is really this differential operator operating on y, which we certainly 
know to be a function of x. 


(ii) The second and third alternative notations are normally used when the derivative of a 
function of x is being considered without reference to a second variable, y. 

(iii) The derivative of a constant function must be zero since the rate of change of something 
which never changes is obviously zero. 

(iv) Geometrically, the derivative represents the gradient of the tangent at the point 

(x, y) to the curve whose equation is 

V = f(x). 
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(b) Differentiation from First Principles 

Ultimately, the derivatives of simple functions may be quoted from a table of standard 
results; but the establishing of such results requires the use of the definition of a derivative. 
We illustrate with two examples the process involved: 

EXAMPLES 

1. Differentiate the function x A from first principles. 

Solution 

Here we have a situation where the variable y is not mentioned; so, we could say “let 
y = x Av , and determine ^ from first principles in order to answer the question. 

However, we shall choose the alternative notation which does not require the use of y 
at all. 


d 

da; 



, (x + Sx) A — x A 
hm ? 

6x^0 OX 


Then, from Pascal’s Triangle (Unit 2.2), 


d 

da; 



x A + 4 x 3 Sx + 6x 2 (5x) 2 + 4a;(5a;) 3 + (Sx) A — x A 
5x^0 Sx 


= lim 

<5x— >0 L 


4a; 3 + 6 x 2 Sx + 4:x(5x) 2 + (5xY 


= 4a; 3 . 


Note: 

This result illustrates a general result which will not be proved here that 


da; 


\x = nx 


„n—l 


for any constant value n, not necessarily an integer. 


2. Differentiate the function sin a; from first principles. 

Solution 


— [sin x] 
ax 


lim 


sin(a; + Sx) — sin a; 
Sx 
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which, from Trigonometric Identities (Unit 3.5), becomes 


— [sin x\ 
da; 


2 cos ( x + ^ ) sin 
lim V 2 ’ 

5x — >0 OX 


= lim cos 

&r— >0 




Finally, using the standard limit (Unit 10.1), 

sin a; 

Inn = 1, 

x—>0 X 

we conclude that 


— [sin a;] = cos a;, 
da; 


Note: 

The derivative of cos a; may be obtained in the same way (see EXERCISES 10.2.5, 
question 2) but it will also be possible to obtain this later (Unit 10.3) by regarding 
cos a; as sin (f — a;). 

3. Differentiate from first principles the function 


log&a; 


where b is any base of logarithms. 

Solution 


d 

da; 


[log&z] 


Um l°gfe(a; + Sx) - log b x 
6x^0 Sx 


But writing 


we have 


= lim 

& r — >0 


i°& (i + g) 

Sx 


— — r that is Sx = rx, 
x 



1 lim lQgfe(1 + r) 
X r -+ 0 r 
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= -limlog 6 (l + r)r. 

X r -> 0 

For convenience, we may choose a base of logarithms which causes the limiting value 
above to equal 1; and this will occur when 

b = lim (1 + r)r . 

r — >0 


The appropriate value of b turns out to be approximately 2.71828 and this is the stan- 
dard base of natural logarithms denoted by e. 


Hence, 



1 

x 


Note: 

In scientific work, the natural logarithm of x is usually denoted by In x and this notation 
will be used in future. 

10.2.5 EXERCISES 

1. Differentiate from first principles the function a; 3 + 2. 

2. Differentiate from first principles the function cos a;. 

3. Differentiate from first principles the function y/x. 

Hint: 

(Vx + Sx — y/x)(V X + 5x + y/x) = 5x. 


10.2.6 ANSWERS TO EXERCISES 


1. 3a: 2 . 

2. —sin ax 
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UNIT 10.3 - DIFFERENTIATION 3 


ELEMENTARY TECHNIQUES OF DIFFERENTIATION 
10.3.1 STANDARD DERIVATIVES 

In Unit 10.2, reference was made to the use of a table of standard derivatives and such a 
table can be found in the appendix at the end of this Unit. 

However, for the time being, a very short list of standard derivatives is all that is necessary 
since other derivatives may be developed from them using techniques to be discussed later 
in this and subsequent Units. 


f(x) 

f'(x) 

a const. 
x n 
sin a; 
cos a: 

In a; 

0 

77 — 1 

nx 
cosx 
— sin x 

i 

—X 


Note: 

In the work which now follows, standard derivatives may be used which have not, here, been 
obtained from first principles; but the student is expected to be able to quote results from a 
table of derivatives including those for which no proof has been given. 

10.3.2 RULES OF DIFFERENTIATION 

(a) Linearity 

Suppose f(x) and g{x) are two functions of x while A and B are constants. Then 

U lAf(x) + Bg(x)] = ^T (/W ] + B±{ g (x)]. 


Proof: 

The left-hand side is equivalent to 

[A f{x + Sx) + Bg{x + &c)] — [Af(x) + Bg(x)\ 
5x^0 Sx 


= A 


lim 

>0 


fjx + Sx) - fjx) 
Sx 


+ B 


lim g(x + Sx) - gjx) 
Sx 
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The result, so far, deals with a “linear combination” of two functions of x but is easily 
extended to linear combinations of three or more functions of x. 


EXAMPLES 


1. Write down the expression for ^ in the case when 


y = 6a: 2 + 2a: 6 + 13a: — 7. 


Solution 

Using the linearity property, the standard derivative of x n , and the derivative of a 
constant, we obtain 


d y 

da: 


eia: 2 ] + 2-^ [a: 6 ] + 13-^- [a: 1 ] 
da: da: da: 


d 

da: 


[7] 


= 12a: + 12a: 5 + 13. 


2. Write down the derivative with respect to x of the function 

5 

— — 4 sin x + 2 In ax 
x z 

Solution 


d 

da: 

d 

da: 


a: z 


4 sin x + 2 In x 


5a: 2 — 4 sin x + 2 In x 


o Zj 

= —10a: — 4 cos a: H — 

x 

-10 2 

= — 5 4 cos x H — . 

X 6 X 
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(b) Composite Functions (or Functions of a Function) 

(i) Functions of a Linear Function 

Expressions such as (5x + 2) 16 , sin (2a: + 3) and ln(7 — 4a;) may be called “functions of a 
linear function” and have the general form 

f(ax + b), 

where a and b are constants. The function f(x) would, of course, be the one obtained on 
replacing ax + b by a single x; hence, in the above illustrations, f(x) would be a; 16 , sin x and 
In x, respectively. 

Functions of a linear function may be differentiated as easily as f(x) itself on the strength 
of the following argument: 


Suppose we write 


y — f(u) where u = ax + b. 


Suppose, also, that a small increase of 5x in x gives rise to increases (positive or negative) 
of 5y in y and Su in u. Then: 


dy 

da; 


,. Sy 
hm — 

— >0 OX 


5y 5u 
hm — — . 
Ou ox 


Assuming that Sy and Su tend to zero as 5x tends to zero, we can say that 

dy Sy Su 

— = hm — x hm — . 

da; ou ox 


That is, 

dy dy dw 

da; d u da; 

This rule is called the “Function of a Function Rule” or “Composite Function Rule” 
or “Chain Rule” and has applications to a much wider class of composite functions than 
has so far been discussed. But, for the moment we restrict the discussion to functions of a 
linear function. 
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EXAMPLES 


1. Determine ^ when y = (5a; + 2) 16 . 

Solution 

First, we write y = w 16 where u = 5x + 2. 
Then, ^ = 16m 15 and p = 5. 

7 dw drr 

Hence, g = 16w 15 .5 = 80(5a; + 2) 15 . 

2. Determine ^ when y = sin(2a; + 3). 

Solution 

First, we write y = sin-u where u = 2a; + 3. 
Then, (p = cosw and )p = 2. 

7 dn da? 

Hence, ^ = cosw.2 = 2cos(2a; + 3). 

3. Determine ^ when y = ln(7 — 4a;). 

Solution 

First, we write y = In u where u = 7 — 4x. 
Then, & = i and ^ = -4. 

Hence, £ = i.(-4) = 


Note: 

It is hoped that the student will quickly appreciate how the fastest way to obtain the deriva- 
tive of a function of a linear function is to treat the expression ax + b initially as if it were 
a single x ; then, multiply the final result by the constant value, a. 

(ii) Functions of a Function in general 


The formula 


d y d y d u 

da; d u da; 

is in no way dependent on the fact that the examples so far used to illustrate it have involved 
functions of a linear function. Exactly the same formula may be used for the composite 
function 
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f[g(x)\, 


whatever the functions f(x ) and g(x ) happen to be. All we need to do is to write 

y — f(u) where u = g(x), 

then apply the formula. 

EXAMPLES 


1. Determine an expression for ^ in the case when 


y = (x 2 + 7x — 3) 4 . 


Solution 

Letting y = u A where u = x 2 + 7x — 3, we have 


dy 

da; 


dy d u 
d u dx 


4u 3 . (2a; + 7) 


= 4(a; 2 + 7a; - 3) 3 (2a; + 7). 


2. Determine an expression for ^ in the case when 


y = ln(a; 2 — 3a; + 1). 


Solution 

Letting y — In u where u = x 2 — 3x + 1, we have 

dy = dy du = l _ 2a; -3 

da; du da; u v x 2 — 3x + 1 


3. Determine the value of ^ at x — 1 in the case when 

ax 

y = 2 sin(5a; 2 — 1) + 19a;. 


Solution 

Consider, first, the function 2sin(5a; 2 — 1) which we shall call z. 
Its derivative is 4^, where z = 2sin(5a; 2 — 1). 
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Let z — 2 sin u where u = 5a; 2 — 1; then, 


u.,4 u.,4 uu 2 \ 

— = — = 2 cos w. 10a; = 20a;cos(5a; — 1). 
da; dw da; 

Hence, the complete derivative is given by 

^ = 20a;cos(5a; 2 — 1) + 19. 
da; 

Finally, when x — 1, this derivative has the value 20 cos 4 + 19, which is approximately 
equal to 5.927, remembering that the calculator must be in radian mode. 


Note: 

Again, it is hoped that the student will appreciate how there is a faster way of differentiating 
composite functions in general. We simply treat g(x) initially as if it were a single x, then 
multiply by g'(x ) afterwards. 

For example, 


d 
da; L 


sin 3 x 


d r 

da; - 


(sin a;) 3 = 3(sina;) 2 . cos a; = 3sin 2 a;. cos a;. 


10.3.3 EXERCISES 

1. Determine an expression for ^ in the following cases: 
(a) 


(b) 


y = 3a; 3 — 8a; 2 + 11a; + 9; 


y = 10 cos x + 5 sin x — 14a; 7 ; 


y = (2x- 7) 5 ; 


(d) 


!l - (2 - 5.r) 3; 
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y = sin — x'j ; that is, cos a:; 


y = cos(4a: + 1); 


y = In (4 — 2x); 


V = In 


3a: — 8 
6a: + 2 


2. Determine an expression for ^ in the cases when 


V = ( 4 - 7a: 3 ) 8 ; 


y = (a: 2 + l) 5 ; 


y = cos x; 


y = ln(ln x). 

3. If y = sin(cosa:), evaluate ^ at x — |. 

4. If y — cos(7a: 5 — 3), evaluate at x — 1. 


10.3.4 ANSWERS TO EXERCISES 


1. (a) 


9ar — 16a: + 11; 


TO sin x + 5 cos x — 98ar 
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(c) 

(d) 

(e) 

(f) 

(g) 

(h) 

2 . (a) 

(b) 

(c) 

(d) 

3. 

4. 


10(2x — 7) 4 ; 

25 . 7 

y(2-5®)-*; 

— cos — x^j ; that is, — sinx; 

— 4sin(4x + 1); 

-2 2 

or ; 

4 — 2x 2x — 4 

3 6 54 

3x — 8 6x + 2 (3x — 8 ) ( 6 x + 2) ' 

— 168x 2 (4 — 7x 3 ) 7 ; 

3x(x 2 + 1)^; 

— 5cos 4 x. sinx; 

1 

x lnx 
-1 


-35 sin 4 = 26.488 



APPENDIX - A Table of Standard Derivatives 


f(x) 

fix) 

a (const.) 

0 

x n 

71 — 1 

nx 

sin ax 

a cos ax 

cos ax 

—a sin ax 

tan ax 

asec 2 ax 

cot ax 

— acosec 2 ax 

sec ax 

asec ax. tan ax 

cosec ax 

— acosec ax. cot ax 

111 X 

1/x 

gCLX 

ae ax 

a x 

a x . In a 

sinh ax 

a cosh ax 

cosh ax 

a sinh ax 

tanh ax 

asecth ax 

sech ax 

— asech ax. tanh ax 

cosech ax 

— acosech ax. cothx 

ln(sinx) 

cot X 

In (cos x) 

— tan x 

In (sinh x) 

coth x 

ln(coshx) 

tanh x 

sin _1 (x/a) 

1 /Via 2 - x 2 ) 

cos _1 (x/a) 

-l/v/(a 2 - x 2 ) 

tan _1 (x/a) 

a/(a 2 + x 2 ) 

sinh _i (x/a) 

l/# 2 + a 2 ) 

cosh _1 (x/a) 

1 /v/(x 2 -a 2 ) 

tanh _1 (x/a) 

a/ (a 2 — x 2 ) 
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UNIT 10.4 - DIFFERENTIATION 4 


PRODUCTS, QUOTIENTS AND LOGARITHMIC DIFFERENTIATION 
10.4.1 PRODUCTS 

Suppose 

y = u(x)v(x), 

where u(x) and v(x) are two functions of x. 

Suppose, also, that a small increase of Sx in x gives rise to increases (positive or negative) 
of Su in u, Sv in v and 5y in y. 

Then, 

d y (u + 5u) (v + Sv) — uv 

— = Inn 

da; Sx^o ox 


uv + uSv + vSu + SuSv — uv 
= hm 

5x^0 OX 


lim 

>0 


Sv Su 
+ “fa 


Hence, 

d du d u 

— \u.v\ = u - — b V — . 
ax ax ax 


Hint: Think of this as 

(FIRST x DERIVATIVE OF SECOND) + (SECOND x DERIVATIVE OF FIRST) 
EXAMPLES 

1. Determine an expression for (p in the case when 

y = x 7 cos 3a;. 


Solution 


— = x 7 . — 3 sin 3a; + cos 3a;. 7a; 6 = a; 6 [7 cos 3a; — 3a; sin 3x1 . 
da; 
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2. Evaluate 4^ at x = —1 in the case when 

ax 

y = (x 2 — 8) In (2 a; + 3). 

Solution 

— 8 

hxln(2x + 3) . 

2x + 3 V _ 

When x = —1, this has value —14 since In 1 = 0. 


= (a; 2 — 8). — - — .2 + ln(2a; + 3). 2x = 2 
clx 2x 3 


10.4.2 QUOTIENTS 

Suppose, this time, that 


u(x) 

v(x) 


Then, we may write 


y — u(x).[v(x)] 1 

in order to use the rule already known for products. 


We obtain 


which can be rewritten as 


dy 

da; 


«■(-!) M 


2 du 

da; 


+ v 


-l 


d u 

da;’ 


d \u 
da; Lu. 


ax ax 


EXAMPLES 


1. Using the formula for the derivative of a quotient, show that the derivative with respect 
to x of the function tana; is the function sec 2 a;. 

Solution 


— [tan x] 

da; 


d r sin x ' 

da; Lcosa;. 


(cos a;), (cos a;) — (sin x). (— sin x) 
cos 2 x 


cos 2 a; + sin 2 a; 
cos 2 a; 


cos 2 a; 


= sec x. 
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2. Determine an expression for ^ in the case when 

2x + 1 

V ~ (bx-3) 3 ' 


Solution 

Using u(x) = 2x + 1 and n(x) = (5a; — 3) 3 , we have 

d y _ (bx - 3) 3 .2 - (2x + l).3(5x - 3) 2 .5 
da; (5a; — 3) 6 

The expression (5a; — 3) 2 may be cancelled as a common factor of both numerator and 
denominator, leaving 


dy _ (5a; — 3). 2 — 15(2a; + 1) 20a; + 21 

da; “ (5a; - 3) 4 “ _ (5a;-3) 4 ‘ 


Note: 

The step in the second example above, where a common factor could be cancelled, may 
be avoided if we use a modified version of the rule for quotients when the function can be 
considered in the form 

u 

v 11 ' 

It can be shown that, if 
then, 


For instance, in Example 2 above, we could write 

u = 2x + 1 v = bx — 3 and n — 3 

Hence, 

dy _ (bx — 3). 2 — 3(2a; + 1).5 
da; (5a; — 3) 4 

as before. 


V = 


u 


d V ^ - rmf 

da; v n+l 


dv 
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10.4.3 LOGARITHMIC DIFFERENTIATION 

The algebraic properties of natural logarithms (see Unit 1.4), together with the standard 
derivative of In x and the rules of differentiation, enable us to differentiate two specific kinds 
of function as described below: 

(a) Functions containing a variable index 

The most familiar function with which to introduce this technique is the 

“exponential function”, e x . 

Suppose we let 

y = e x ; 

then, by properties of natural logarithms, we can write 

In y = X] 

and, if we differentiate both sides with respect to x, we obtain 


That is, 


Hence, 


Notes: 


Id y 

y da; 


d y 

— = y = e . 

ax 



(i) After taking logarithms, we could have differentiated the statement x = In y with respect 
to y, obtaining 

da; 1 

d y y' 


But it can be shown that, for most functions, 

d V_ = 

da; (b 

so that the same result is obtained as before. 
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(ii) The derivative of e x may easily be used to establish the standard derivatives of the 
hyperbolic functions, sinhrc, cosh re and tanh x as follows: 


f [sinlirr] = coshrr, f [coshrr] = sinh x, f [tanhx] = sech 2 x. 
dre dre dre 


The hrst two of these follow from the definitions 


e x + e x 


sinh x = 


and coshrc = 


while the third may be obtained using the definition 


sinh x 

tanh x = , 

cosh x 


together with the Quotient Rule. 

FURTHER EXAMPLES 

1. Write down the derivative with respect to x of the function 

gSinx 


Solution 

All that is required in this example is the standard derivative of e x together with the 
Function of a Function Rule. We obtain 


d 

dre 


gSintc 


gSinx 


cosx. 


2. Determine an expression for 


-r in the case when 


y = (3x + 2) x . 


Solution 

Taking natural logarithms of both sides, 

hay — x ln(3rc + 2). 
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Differentiating both sides with respect to x and using the Product Rule gives 


Id y _ x 3 
y dx 3x + 2 


+ ln(3a; + 2).l. 


Hence, 


d y 

da; 


(3a; + 2) x 


3x 

_3x + 2 


+ ln(3a; + 2) . 


(b) Products or Quotients with more than two elements 

We have already discussed the rules for differentiating products and quotients; but, in cer- 
tain cases, it is easier to make use of logarithmic differentiation. Essentially, we use this 
alternative method when a product or a quotient involves more than the two functions u(x ) 
and v(x) mentioned earlier. 

We illustrate with examples: 

EXAMPLES 


1. Determine an expression for ^ in the case when 

y = e x2 . cosx.(x + l) 5 . 


Solution 

Taking natural logarithms of both sides, 

lny = x 2 * + ln(cosa;) + 51n(a; + 1). 


Differentiating both sides with respect to x, 

1 d y sin x 5 

-~r- = 2x 1 -. 

y da; cos x x + 1 


Hence, 


cl y i \ 5 

— — e . cosx.(x + 1) 
da; 


2x — tan x H . 

x + 1. 


2. Determine an expression for 


A in the case when 


e x . sin a; 

(7x + l) 4 ' 
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Solution 

Taking natural logarithms of both sides, 

In y — x + ln(sin x) — 4 ln(7a; + 1). 

Differentiating both sides with respect to x, 

1 dy cos x . 7 

y da; sin x 7x + 1 

Hence, 

28 ' 

7x + 1. 

Note: 

In all examples on logarithmic differentiation, the original function will appear as a factor 
at the beginning of its derivative. 

10.4.4 EXERCISES 


dy e x . sin x \ 

7 — r 1 + COt X 

da; (7x + l) 4 


1. Differentiate the following functions with respect to x: 
(a) 


(b) 


(d) 


sin a;, cos a;; 
(x 2 + 3). sin 2a;; 
x.(x 2 + 1)5; 
x 2 ln(l — 2x). 


2. Differentiate the following functions with respect to x: 
(a) 


(that is, cot a;); 

sm a; 
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(b) 


x 2 -2 _ 
(x + l) 2 ’ 


cos x + sin x 
cos x — sin x 


(d) 


x 


(2x — x 2 ) 2 

3. Differentiate the following functions with respect to x: 


(b) 


0 X 2 + 1. 


A—x—x z 


(2x + l)e 


4-x 6 . 


(d) 


d 1—7x 


3a; + 2’ 


x. sinh(x 2 + 1); 


(f) 


4. 


sech x. 

Use logarithms to differentiate the following functions with respect to x: 
(a) 


a x (a constant); 



(b) 

(x 2 + l) 3 *; 

(c) 

(smx) x ] 

(d) 

x(x — 2) 

(x + l)(x + 3) ’ 

(e) 

e 2x . In x 


1 

co 

10.4.5 ANSWERS TO EXERCISES 

1. (a) 

cos 2 x — sin 2 x (or cos 2x) 

(b) 

2(x 2 + 3) cos 2x + 2x sin 2x\ 

(c) 

2x 2 + 1 _ 
(x 2 + l) 1 2 , 

(d) 

2x 2 

2xln{l 2x) 1 _ 2a _. 

2. (a) 

— cosec 2 x; 

(b) 

4 + 2x 
(x + 1) 3 ’ 


9 



(c) 


2 

cos a; — sin re) 2 ’ 


(d) 

3. (a) 

(b) 

(c) 

(d) 

(e) 

(f) 

4. (a) 

(b) 

(c) 

(d) 

(e) 


3 • 

(2re — re 2 ) 2 
2rce a ’ 2+1 ; 

— (1 + 2x)e 1 ~ x ~ x2 ] 

2.e A ~ x3 — 3rc 2 (2rc + l)e 4_a;3 ; 

e 1 ~ 7x .(21x + IT) _ 

(3rc + 2) 2 ’ 

sinh(rc 2 + 1) + 2rc 2 cosh(re 2 + 1); 
— cosech 2 re. 
a x . In a; 

(re 2 + l) 3 * [ 3 ln(re 2 + l) + 

OC | X 


(sin x) x [In sin re + re cot re] ; 

rc(rc — 2) rl 1 _ 1 _ 1 ' 

(re + 1) (re + 3) .re re — 2 re + 1 re + 3. 

e 2x .lnre f 1 3 1 

2 4 . 

(re — l) 3 L re In re re — 1. 


10 
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UNIT 10.5 - DIFFERENTIATION 5 
IMPLICIT AND PARAMETRIC FUNCTIONS 

10.5.1 IMPLICIT FUNCTIONS 

Some relationships between two variables x and y do not give y explicitly in terms of x (or 
x explicitly in terms of y); but, nevertheless, it is implied that one of the two variables is 
a function of the other. In the work which follows, we shall normally assume that y is a 
function of x. 

Consider, for instance, the relationship 

x 2 + y 2 = 16, 

which is not explicit for either x or y but could, if desired, be written in one of the two forms 

y = ±\/l6 — x 2 or x = 16 — y 2 . 

By contrast, consider the relationship 

x 2 y 3 + 9 sin(5x — 7 y) = 10. 

In this case, there is no apparent way of stating either variable explicitly in terms of the 
other; yet we may still wish to calculate or even 

Such relationships between x and y are said to be “implicit relationships” and, in the 
technique of “implicit differentiation” , we simply differentiate each term in the relation- 
ship with respect to the same variable without attempting to rearrange the formula. 

EXAMPLES 

1. Determine an expression for (p in the case when 

x 2 + y 2 = 16. 


Solution 

Treating y 2 as a function of a function, we have 

2a; + 2 y^ = 0. 

da; 


Hence, 


dy —2a; x 
da; 2 y y 


1 



It is perfectly acceptable that the result is expressed in terms of both x and y; this will 
normally happen. 

2. Determine an expression for ^ in the case when 

x 2 + 2 xy 3 + y 5 = 4. 


Solution 

Treating y 3 and y 5 as functions of a function and using the Product Rule in the second 
term on the left hand side, 


2x + 2 


x.3 y 2 


dy 

da; 


+ y 3 . i 


+ 5y 4 


dy 

da; 


0 . 


On rearrangement, 


Hence, 


6 xy 2 + 5 y 4 


dy 

da; 


— (2a; + 2y 3 ). 


dy 2x + 2 y 3 
da; Qxy 2 + 5y 4 


3. Determine an expression for ^ in the case when 


x 2 y 3 + 9 sin(5a; — 7 y) = 10. 


Solution 

Differentiating throughout with respect to x and using both the Product Rule and the 
Function of a Function Rule, we obtain 


a; 2 . 3 y 2< -^ + y 3 . 2a; + 9 cos(5a; — 7y). 


5-7 


dy 

da; 


= 0 . 


On rearr augment, 


3 x 2 y 2 — 63 cos(5a; 



2 xy 3 + 45 cos(5a; — 7 y) . 


Thus, 


dy 2 xy 3 + 45 cos(5a; — 7 y) 

da; 3a; 2 y 2 — 63 cos(5a; — 7y) 


2 



10.5.2 PARAMETRIC FUNCTIONS 


In the geometry of straight lines, circles etc, we encounter “parametric equations” in 
which the variables x and y, related to each other by a formula, may each be expressed 
individually in terms of a third variable, usually t or 0, called a “parameter”. 


In general, we write 


x = x(t) and y = y(t ); 


but, in theory, we can imagine that t could be expressed explicitly in terms of x ; so, essen- 
tially, y is a function of t, where t is a function of x. Hence, from the Function of a Function 
Rule, 


d y d y d t 

da; dt da; 

However, we are not given t explicitly in terms of x and it may not be practical to obtain it 
in this form. Therefore, we write 


d y d y Ax 

da; dt dt 

This is the standard formula for differentiating y with respect to x from a pair of parametric 
equations. 

EXAMPLES 

1. Determine an expression for ^ 

Solution 

Hence, 


in terms of t in the case when 


x = 3 t 2 and y = 6 1. 


dy , da; 

— = 6 and — = 6 1. 
dt dt 


dy 6 1 

da; 6 1 t 


3 



2. Determine an expression for ^ in terms of # in the case when 

x = sin 3 # and y = cos 3 #. 


Solution 


Hence, 


That is, 


— = 3sin 2 #. cos # and -rj = — 3cos 2 #. sin #. 
d# d# 


d y — 3cos 2 #. sin # 

da; 3sin 2 #. cos # 


dy 

da; 


cos # 
sin# 


— cot #. 


10.5.3 EXERCISES 


1. Determine an expression for ^ in the following cases: 
fa) 


(b) 


x 2 + y 2 = 10a;; 


x 3 + y 3 — 3 xy 2 = 8; 


(d) 


x 4 + 2 x 2 y 2 + y 4 = x; 


xe y = cos y. 


2. Determine an expression for ^ in terms of the appropriate parameter in the following 
cases: 


(b) 


x = 3 sin # and y = 4 cos #; 


x = At and y 


4 

V 


4 



(c) 

x—{l — t)5 and y — (1 — t 2 ) 5. 


10.5.4 ANSWERS TO EXERCISES 


1. (a) 


(b) 


(c) 


(d) 


2. (a) 


(b) 


(c) 


d y 5 — x 

dx y 


d y y 2 — x 2 

da; y 2 — 2 xy ’ 


d y 1 — 4a; 3 — 4 xy 2 
dx 4 (x 2 y + y 3 ) 


dy e y 

da; xe y + sin y 


dy 

da; 


— - tan#; 
o 


d y = _ 

da; t 2 ’ 


dy 2 1 
da; (l + t)5 - 


5 
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UNIT 10.6 - DIFFERENTIATION 6 

DERIVATIVES OF INVERSE TRIGONOMETRIC FUNCTIONS 
10.6.1 SUMMARY OF RESULTS 

The derivatives of inverse trigonometric functions should be considered as standard results. 
They will be stated here first, before their proofs are discussed. 


1 . 


where — | < sin l x < |. 

2 . 


where 0 < cos 1 x < 7 r. 

3 . 




u. r — 1 i 

— tan x\ 
da; 


1 

1 + x 2 ’ 


where -f < tan -1 ® < f . 


10.6.2 THE DERIVATIVE OF AN INVERSE SINE 


We shall consider the formula 
and determine an expression for 


y = Sin 1 x 


Note: 

There is a special significance in using the upper-case S in the formula; the reason will be 
explained later. 


The formula is equivalent to 

x = sin y, 

so we may say that 

da; 

— = cos y = ± 
d y 


1 — sin 2 ?/ = ±\/l — x 2 . 


1 



Thus, 


d V _ 1 

da; a/1 - x 2 ' 

Consider now the graph of the formula 


y = Sin l x , 

which may be obtained from the graph of y = sin a; by reversing the roles of x and y and 
rearranging the new axes into the usual positions. We obtain; 



Observations 


(i) The variable x must lie in the interval — 1 < x < 1; 

(ii) For each value of x in the interval — 1 < x < 1, the variable y has infinitely many values 
which are spaced at regular intervals of |. 

(iii) For each value of x in the interval — 1 < x < 1, there are only two possible values of 
one of which is positive and the other negative. 

(iv) By restricting the discussion to the part of the graph from y — — | to ?/ = |, there 
will be only one value of y and one (positive) value of (p for each value of x in the interval 
-1< x <1. 


The restricted part of the graph defines what is called the “principal value” of the inverse 
sine function and is denoted by sin _1 a; using a lower-case s. 


Hence, 


u r ■ — i i 
— Sill x\ 

da; 


1 


a/1 — x 2 


2 



10.6.3 THE DERIVATIVE OF AN INVERSE COSINE 


We shall consider the formula 
and determine an expression for 


y = Cos 1 x 


Note: 

There is a special significance in using the upper-case C in the formula; the reason will be 
explained later. 


The formula is equivalent to 

x = cos y, 

so we may say that 

da; 

— = — sin y = ± 
d y 


'1 — cos 2 y = ±v 1 — x 2 . 


Thus, 



Consider now the graph of the formula 

y = Cos _1 a; 

which may be obtained from the graph of y = cos a; by reversing the roles of x and y and 
rearranging the new axes into the usual positions. We obtain: 



3 



Observations 


(i) The variable x must lie in the interval — 1 < x < 1; 

(ii) For each value of x in the interval — 1 < x < 1, the variable y has infinitely many values 
which are spaced at regular intervals of |. 

(iii) For each value of x in the interval — 1 < x < 1, there are only two possible values of 
one of which is positive and the other negative. 

(iv) By restricting the discussion to the part of the graph from y = 0 to y = n, we may 
distinguish the results from those of the inverse sine function; and there will be only one 
value of y with one (negative) value of (p for each value of x in the interval — 1 < x < 1. 

The restricted part of the graph defines what is called the “principal value” of the inverse 
cosine function and is denoted by cos _1 a; using a lower-case c. 


Hence, 


u r — i i 

— cos x\ 

da: 


1 


y/l — x 2 


10.6.4 THE DERIVATIVE OF AN INVERSE TANGENT 


We shall consider the formula 


and determine an expression for 


y = Tan l x 


Note: 

There is a special significance in using the upper-case T in the formula; the reason will be 
explained later. 


The formula is equivalent to 


x = tan y, 


so we may say that 


— = sec 2 y = 1 + ta n 2 y = 1 + x 2 . 
d y 


Thus, 


dy = 1 

da; 1 + x 2 


4 



Consider now the graph of the formula 

y = Tan _1 a;, 

which may be obtained from the graph of y — tana; by reversing the roles of x and y and 
rearranging the new axes into the usual positions. We obtain; 


y 



Observations 

(i) The variable x may lie anywhere in the interval — oo < x < oo; 

(ii) For each value of x, the variable y has infinitely many values which are spaced at regular 
intervals of n. 

(iii) For each value of x , there is only possible value of which is positive. 

(iv) By restricting the discussion to the part of the graph from y = — | to ?/ = |, there will 
be only one value of y for each value of x. 

The restricted part of the graph defines what is called the “principal value” of the inverse 
tangent function and is denoted by tan _1 a; using a lower-case t. 

Hence, 


Ll _ 1 . 

— tan x\ 
da; 


1 

1 + x 2 ' 


ILLUSTRATIONS 

1 . 


2 . 


d _l n 
— sm 2a; 
da; 


2 

a/1 — 4a; 2 



(i + 3)] 


1 

\A-(i + 3) 2 ' 


5 



3 . 


d 

da; 


[tan 1 (sina;) 


cos a; 

1 + sin 2 a; 


da; 


[sin 1 (a; 5 )] 


5a; 4 




x 


10 


(real only if — 1 < x < 1). 


10.6.5 EXERCISES 


1. Determine an expression for ^ in the following cases, assuming any necessary restric- 
tions on the values of x: 

(al 


(b) 


y = cos x 7x] 


y = tan ^(cosa;); 


y = sin x (3 — 2x). 


2. Show that 

3. If 

show that 
(a) 


d 

da; 


tan 


_ 1 fl + tana; 
1 — tan x 


= 1 . 


sin l x 


V = 


vT 


x z 


(b) 




(1 — — xy + 1', 

ax 




6 



10.6.6 ANSWERS TO EXERCISES 


1. (a) 


(b) 


7 

Vl -49a; 2 


1 1 

real only if — - < x < - 

7 7 


sin x 

1 + cos 2 a; ’ 


2 

V'l 4'(3-2jy 


(real only if 1 < x < 2). 


7 
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UNIT 10.7 - DIFFERENTIATION 


DERIVATIVES OF INVERSE HYPERBOLIC FUNCTIONS 
10.7.1 SUMMARY OF RESULTS 

The derivatives of inverse trigonometric and inverse hyperbolic functions should be consid- 
ered as standard results. They will be stated here, first, before their proofs are discussed. 


HS [sillh " lxl = Trb?' 


where — oo < sinh 1 x < oo 


llcosh-'x] = , 


where cosh l x > 0. 


3 . 


^Hanh-U] = -±- 2 , 


where — oo < tanh l x < oo. 


10.7.2 THE DERIVATIVE OF AN INVERSE HYPERBOLIC SINE 


We shall consider the formula 


and determine an expression for 


y = Sinh 1 a: 


Note: 

The use of the upper-case S in the formula is temporary; and the reason will be explained 
shortly. 


The formula is equivalent to 
so we may say that 

da; 

d y 


x = sinhy, 


cosh y = \jl-\- sin 2 y 


Vl + x 2 , 


1 



noting that cosh y is never negative. 
Thus, 



Consider now the graph of the formula 

y = Sinh - 'a: 

which may be obtained from the graph of y — sinh x by reversing the roles of x and y and 
rearranging the new axes into the usual positions. We obtain: 



Observations 


(i) The variable x may lie anywhere in the interval —oo<x<oo. 

(ii) For each value of x, the variable y has only one value. 

(iii) For each value of x , there is only one possible value of which is positive. 

(iv) In this case (unlike the case of an inverse sine, in Unit 10.6) there is no need to distinguish 
between a general value and a principal value of the inverse hyperbolic sine function. This 
is because there is only one value of both the function and its derivative. 

However, it is customary to denote the inverse function by sinh _1 x, using a lower-case s 
rather than an upper-case S. 


Hence, 


— [sinh 1 x] 
da; 



2 



10.7.3 THE DERIVATIVE OF AN INVERSE HYPERBOLIC COSINE 


We shall consider the formula 


and determine an expression for 


y = Cosh l x 


Note: 

There is a special significance in using the upper-case C in the formula; the reason will be 
explained shortly. 


The formula is equivalent to 
so we may say that 

da; 
d y 


x = coshy, 


sinhy = ±v cosh y — 1 = 


±\/x 2 — 1. 


Thus, 


dy = 1 

da; \Jx 2 - 1 ’ 


Consider now the graph of the formula 


y = Cosh x a;, 


which may be obtained from the graph of y = cosh a; by reversing the roles of x and y and 
rearranging the new axes into the usual positions. We obtain: 


y 


0 



3 



Observations 


(i) The variable x must lie in the interval x > 1. 

(ii) For each value of x in the interval x > 1, the variable y has two values one of which is 
positive and the other negative. 

(iii) For each value of x in the interval x > 1, there are only two possible values of one 
of which is postive and the other negative. 

(iv) By restricting the discussion to the part of the graph for which y > 0, there will be only 
one value of y with one (positive) value of ^ for each value of x in the interval x > 1. 

The restricted part of the graph defines what is called the “principal value” of the inverse 
cosine function and is denoted by cosh _1 a;, using a lower-case c. 


Hence, 


— [cosh 1 x] 
da; 


1 



10.7.4 THE DERIVATIVE OF AN INVERSE HYPERBOLIC TANGENT 


We shall consider the formula 
and determine an expression for p. 


y = Tanh l x 


Note: 

The use of the upper-case T in the formula is temporary; and the reason will be explained 
later. 


The formula is equivalent to 
so we may say that 

da; 
d y 


x = tanh y, 


sech 2 y = 1 — tanh 2 y = 1 — x 2 . 


Thus, 


d y = i 

da; 1 — x 2 


Consider now the graph of the formula 

y = Tanh -1 a;, 


4 



which may be obtained from the graph of y — tanh x by reversing the roles of x and y and 
rearranging the new axes into the usual positions. We obtain: 



Observations 


(i) The variable x must lie in the interval — 1 < x < 1. 

(ii) For each value of x in the interval — 1 < x < 1, the variable y has just one value. 

(iii) For each value of x in the interval —1 < x < 1, there is only possible value of which 
is positive. 

(iv) In this case (unlike the case of an inverse tangent in Unit 10.6) there is no need to 
distinguish between a general value and a principal value of the inverse hyperbolic tangent 
function. This is because there is only one value of both the function and its derivative. 

However, it is customary to denote the inverse hyperbolic tangent by tan~ 1 x using a lower- 
case t rather than an upper-case T. 


Hence, 


— [tanh l x] 

da: 


1 

1 — x 2 ' 


ILLUSTRATIONS 


1 . 



1 (tanh a;) 


seclra; 


\Jl- tanh 2 a; 


sechax 


5 



2 . 


5 


— [cosh 1 (5a; 
da; 


4 )] 


(5a; — 4) 2 - l’ 


assuming that 5a; — 4 > 1; that is, x > 1. 


10.7.5 EXERCISES 

Obtain an expression for ^ in the following cases: 

1 . 


y = cosh 1 (4 + 3a;), 

assuming that 4 + 3a; > 1; that is, x > —1. 

2 . 


y = cos 1 (sinha;), 


assuming that — 1 < sinha; < 1. 
3. 


y = tanh 1 (cosa;). 


4. 


y = cosh 1 (a; 3 ), 


assuming that x > 1. 
5. 


, _i 2a; 

y = tanh 

y 1 + a; 2 


6 



10.7.6 ANSWERS TO EXERCISES 


1 . 


2 . 


3. 


3 

•^(4 + 3a;) 2 - l' 


cosh x 


1 — sinhx 


— coseca;. 


4. 


5. 


3a; 2 

\/a; 6 — 1 


2 

1 — x 2 ’ 


7 
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UNIT 10.8 - DIFFERENTIATION 8 


HIGHER DERIVATIVES 
10.8.1 THE THEORY 

In most of the examples (seen in earlier Units) on differentiating a function of x with respect 
to x, the result obtained has been another function of x. In general, this will be the case 
and the possibilty arises of differentiating again with respect to x. 

This would occur, for example, in the case when the formula 

V = f(x) 

represents the distance, y, travelled by a moving object at time, x. 

The speed of the moving object is the rate of increase of distance with respect to time; that 
is, But a second quantity called acceleration is defined as the rate of increase of speed 
with respect to time. It is therefore represented by the symbol 

d d y 
dx dx 


but this is usually written as 

d 2 y 

dx 2 

and is pronounced “d two y by da; squared” . 

We could, if necessary, differentiate over and over again to obtain the derivatives of order 
three, four, etc., namely 


d 3 y d 4 y 

dad and TU’ 


etc. 


EXAMPLES 


1. If y = sin 2a;, show that 


d 2 y 
da; 2 


+ 4y 


0 . 


1 



Solution 

Firstly, 

dy 0 0 

— = 2 cos 2x, 
ax 

so that, on differentiating a second time, we obtain 

d 2 y 

— — = —4 sm 2a; = —4 y. 
dx 2 y 

Hence, the result follows. 

2. If y = a; 4 , show that every derivative of y with respect to x after the fourth derivative 
is zero. 

Solution 


^ = 4x 3 - 
dx 

= 12a: 2 - 

dx 2 

d 3 y 

^-3 = 24a d 
da;- 5 

d 4 y 

da; 4 


= 24. 


We now have a constant function, so that all future derivatives will be zero. 
Note: 

In general, every derivative of y = x n after the n-th derivative will be zero. 

3. If x = 3 f 2 and y = 6 1, obtain an expression for in terms of t. 

Solution 


Arr dx ’ 

UX d t 


giving 

dy 6 1 

da; 6 1 t ' 

In order to differentiate again with respect to x , we observe that, in the formula for the 
first derivative with respect to x, we need to replace y with 

That is, 


d 2 y d 

dy 

d 

d y 
dx 

da; 2 da; 

da; 

da; 


2 



Hence, 


d 2 y 
dx 2 


A Al 

d t dx 


drr 

d t 


In the present example, therefore, we obtain 


d 2 y 
dx 2 


A I 

d t t 


6 1 



i 

6 1 3 


Note: 

For a function f(x), an alternative notation for the derivatives of order two, three, four, etc. 

is 


/"0), f {lv \x), etc. 


10.8.2 EXERCISES 

1. Obtain expressions for j in the following cases: 

(a) 

y = 4r 3 — 7x 2 + 5x — 17; 


(b) 


V = (3x - 2) 10 ; 


(d) 


V = x 2 e 4x ] 


y = 


x — 1 

X + 1 ’ 


2. If y = sin 3a;, evaluate ‘[j when x = |. 

3. If x 2 + y 2 — 2x + 2y = 23 determine the values of ^ and at the point where x = —2 
and y = 3. 

4. If x = 3(1 — cos 6) and y = 3 (6 — sind), show that 


3 



(a) 

(b) 


dy 

da; 



d 2 y 1 

da; 2 12 sin | cos 3 1 


5. If y 


3e 2x cos(2a; — 3), verify that 


d 2 y 

dx 2 



+ 8 y 


0 . 


10.8.3 ANSWERS TO EXERCISES 


1. (a) 

(b) 


(d) 


2 . 


3. 


d 2 y 
da; 2 


12a; 2 - 14; 


d 2 y_ 

da; 2 


810(3a; — 2) 8 ; 


d 2 y = 

dx 2 


16a; 2 + 16a; + 2 


d 2 y 4 

da; 2 ( x + l 3 


9 

71 ' 


3 

4 


and — 


25 

64" 


4 
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UNIT 11.1 - APPLICATIONS OF DIFFERENTIATION 1 


TANGENTS AND NORMALS 
11.1.1 TANGENTS 

In the definition of a derivative (Unit 10.2), it is explained that the derivative of the function 
f(x ) can be interpreted as the gradient of the tangent to the curve y = f(x ) at the point 
(x,y). 

We may now use this information, together with the geometry of the straight line, in order 
to determine the equation of the tangent to a given curve at a particular point on it. 

We illustrate with examples which will then be used also in the subsequent paragraph dealing 
with normals. 

EXAMPLES 

1. Determine the equation of the tangent at the point (—1, 2) to the curve whose equation 

is 

y = 2x 3 + 5a; 2 — 2x — 3. 


Solution 


— = 6a; 2 + 10a; — 2, 
da; 


which takes the value —6 when x — — 1. 

Hence the tangent is the straight line passing through the point (—1, 2) having gradient 
—6. Its equation is therefore 

y — 2 = — 6(a; + 1). 


That is, 


6a; + y + 4 = 0. 


2. Determine the equation of the tangent at the point (2, —2) to the curve to the curve 
whose equation is 


x 2 + y 2 + 3 xy + 4 = 0. 


Solution 


2a; + 2y < Y~ + 3 
da; 


dy 

x-f- + y 
dx 


0 . 


That is, 


1 



d y 2x + 3 y 

dx 3x + 2y ’ 

which takes the value —2 at the point (2, —2). 

Hence, the equation of the tangent is 

y + 2 = —2(x — 2). 


That is, 


2x + y — 2 = 0. 

3. Determine the equation of the tangent at the point where t — 2 to the curve given 
parametrically by 


x = 


3 1 

1 + t 


and y 


i 2 

1 + t 


Solution 

We note first that the point at which t — 2 has co-ordinates ^2, | 
Furthermore, 


da; 3 dy 2t + 1 2 

dt = (1 + t) 2 anC d t = (1 + t) 2 ’ 

by the quotient rule. 

Thus, 

dy 2 1 + t 2 

da; 3 

which takes the value | when t — 2. 

Hence, the equation of the tangent is 


That is, 


3y + 12 = 8a;. 


2 



11.1.2 NORMALS 


The normal to a curve at a point on it is defined to be a straight line passing through this 
point and perpendicular to the tangent there. 

Using previous work on perpendicular lines (Unit 5.2), if the gradient of the tangent is m, 
then the gradient of the normal will be — 

EXAMPLES 

In the examples of section 11.1.1, therefore, the normals to each curve at the point given 
will have equations as follows: 


1 . 


That is, 


That is, 


That is, 


V- 2 = -(a? H- 1). 
o 


6 y = x + 13. 


V + 2 = 2 ^ -2 )- 


2y = x — 6. 


4 3 

v -3 = s lx - 2) - 


24 y + 9a; = 50. 


Note: 

It may occasionally be required to determine the angle, 6 , between two curves at one of 
their points of intersection. This is defined to be the angle between the tangents at this 
point; and, if the gradients of the tangents are m i = ta ,n9i and m 2 = tand 2 , then the angle 
9 = 9 2 — 9 1 and is given by 
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That is, 


tan 9 


tan 02 — tan 0\ 
1 + tan 0 2 tan 9\ 



11.1.3 EXERCISES 

1. Determine the equations of the tangent and normal to the following curves at the point 
given: 

(a) 

8 y = x 2 3 at (2, 1); 


(b) 


9 r 

e cos x 

(1 + X ) 3 


at (0,1). 


2. The parametric equations of a curve are 

x — 1 + sin 2 1, y = 1 + cos t + cos 2 1. 

Determine the equation of the tangent to the curve at the point for which t = 


4 


to | 



3. Determine the quation of the tangent at the point (2, 3) to the curve whose equation is 

3a; 2 + 2 y 2 = 30. 


4. Determine the equation of the normal at the point (—1, 2) to the curve whose equation 

is 

2 xy + 3 xy 2 — x 2 + y 3 + 9 = 0. 

11.1.4 ANSWERS TO EXERCISES 


1. (a) The tangent is 

and the normal is 
(b) The tangent is 
and the normal is 

2. The tangent is 

3. The tangent is 

4. The normal is 


2y = 3x — 4, 
2x + 3y = 7; 
V = l-x, 
y — x + 1. 

2y — x — 1. 
x + y = 5. 
x + 9y = 17. 
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UNIT 11.2 - APPLICATIONS OF DIFFERENTIATION 2 


LOCAL MAXIMA, LOCAL MINIMA AND POINTS OF INFLEXION 
11.2.1 INTRODUCTION 

(a) Let us first suppose that the formula 

s = f(t) 

represents the distance s, travelled in time t, by a moving object from some previously chosen 
point on its journey. 

The derivative , ^|, of s with respect to t gives the speed of the object at time t and can 
be represented by the slope of the tangent at the point (t, s ) to the curve whose equation is 

s = /(*)■ 

For any instant, to, of time, at which the object is stationary, the value of the derivative will 
be zero and hence, the slope of the tangent will be zero. 

The corresponding point (£ 0 > s o), on the graph may thus be called a “stationary point”. 

(b) More generally, any relationship, 

V = f(x), 

between two variable quantities, x and y, can usually be represented by a graph of y against 
x and any point (To, Ijo) on the graph at which ^ takes the value zero is called a “stationary 
point”. The tangent to the curve at the point (x 0 ,y 0 ) will be parallel to x-axis. 



In the paragraphs which follow, we shall discuss the definitions and properties 
of particular kinds of stationary point. 
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11.2.2 LOCAL MAXIMA 


A stationary point (x 0 ,y 0 ) on the graph whose equation is 

V = f(x) 

is said to be a “local maximum” if y 0 is greater than the y co-ordinates of all other points 
on the curve in the immediate neighbourhood of (xqjZ/o)- 



Note: 

It may well happen that, for points on the curve which are some distance away from (a;o, yo), 
their y co-ordinates are greater than y 0 ; hence, the definition of a local maximum point must 
refer to the behaviour of y in the immediate neighbourhood of the point. 

11.2.3 LOCAL MINIMA 

A stationary point (xo,yo) on the graph whose equation is 

y = /( a ) 

is said to be a “local minimum” if y 0 is less than the y co-ordinates of all other points on 
the curve in the immediate neighbourhood of (x 0 ,|/o)- 
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Note: 

It may well happen that, for points on the curve which are some distance away from (a^o, Vo), 
their y co-ordinates are less than yo ; hence, the definition of a local minimum point must 
refer to the behaviour of y in the immediate neighbourhood of the point. 

11.2.4 POINTS OF INFLEXION 

A stationary point (x 0 ,y 0 ) on the graph whose equation is 

V = f(x) 

is said to be a “point of inflexion” if the curve exhibits a change in the direction bending 
there. 
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11.2.5 THE LOCATION OF STATIONARY POINTS AND THEIR NATURE 


In order to determine the location of any stationary points on the curve whose equation is 

y = }{x), 

we simply obtain an expression for the derivate of y with respect to x , then equate it to zero. 
That is, we solve the equation 

p- = 0. 

ax 

Having located a stationary point (x 0 ,y 0 ), we may then determine whether it is a local 
maximum, a local minimum, or a point of inflexion using two alternative methods. These 
methods will be illustrated by examples: 

METHOD 1. - The “First Derivative” Method 

Suppose e denotes a number which is relatively small compared with xq. 

If we examine the sign of (p , first at x = xq — e and then at x = xq + e, the following 
conclusions may be drawn: 

(a) If the sign of 4^ changes from positive to negative, there is a local maximum at (xo,Z/o)- 

(b) If the sign of ^ changes from negative to positive, there is a local minimum at (xo,Z/o)- 

(c) If the sign of ^ does not change, there is a point of inflexion at (x 0 ,y 0 ). 

EXAMPLES 

1. Determine the stationary point on the graph whose equation is 

y = 3 — x 2 . 


Solution: 


dy 

da; 


= —2a;, 


which is equal to zero at the point where x = 0 and hence, y 
If x — 0 — e, (for example, x = —0.01), then ^ > 0 and 
If x — 0 + e, (for example x = 0.01), then ^ < 0. 

Hence, there is a local maximum at the point (0,3). 


3. 
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2. Determine the stationary point on the graph whose equation is 

y = x 2 — 2x + 3. 

Solution: 

£ = 2*-2, 
da; 

which is equal to zero at the point where x — 1 and hence, y — 2. 
If x — 1 — e, (for example, a; = 1 — 0.01 = 0.99), then qf < 0 and 
If a; = 1 + e, (for example, x — 1 + 0.01) = 1.01, then ^ > 0. 
Hence, there is a local minimum at the point (1,2). 



3. Determine the stationary point on the graph whose equation is 

y = 5 + x 3 . 
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Solution: 


dy 

da; 


= 3a; 2 , 


which is equal to zero at the point where x = 0 and hence, y — 5. 
If x — 0 — e, (for example, x = —0.01), then ^ > 0 and 
If x — 0 + e, (for example, x = 0.01), then ^ > 0. 

Hence, there is a point of inflexion at (0, 5). 



METHOD 2. - The “Second Derivative” Method 

This method considers the general appearance of the graph of (p against x, which is called 
the “first derived curve” . The properties of the first derived curve in the neighbourhood 
of a stationary point ( x 0 ,y 0 ) may be used to predict the nature of this point. 
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(a) Local Maxima 



As x passes from values below x 0 to values above x 0 , the corresponding values of 4^ steadily 
decrease from large positive values to large negative values, passing through zero when 
x = x 0 . 

This suggests that the first derived curve exhibits a “going downwards” tendency at 
x = Xq- 


dy 


dx 



It may be expected, therefore, that the slope at x = Xo on the first derived curve is negative. 
In other words, 


dry 

—— <0 at x = x 0 . 
dx 2 


This is the second derivative test for a local maximum. 
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EXAMPLE 


For the curve whose equation is 
we have 

d y 

dx 

The second derivative is negative everywhere, so it is certainly negative at the stationary 
point (0, 3) obtained in the previous method. Hence, (0, 3) is a local maximum. 

(b) Local Minima 


y = 3-x 2 , 


-2x and = —2. 
dx z 



As x passes from values below x 0 to values above x 0 , the corresponding values of ^ steadily 
increase from large negative values to large positive values, passing through zero when x = Xq. 

This suggests that the first derived curve exhibits a “going upwards” tendency at x = Xq. 



dy 


dx 



It may be expected, therefore, that the slope at x = Xq on the first derived curve is positive. 
In other words, 


d 2 y 
dx 2 


> 0 at x = xq. 


This is the second derivative test for a local minimum. 

EXAMPLE 


For the curve whose equation is 
we have 

dy 

dx 


y = x 2 — 2x + 3, 


2x — 2 and 


d 2 y 
dx 2 


2 . 


The second derivative is positive everywhere, so it is certainly positive at the stationary 
point (1, 2) obtained in the previous method. Hence, (1, 2) is a local minimum. 
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(c) Points of inflexion 



As x passes from values below x 0 to values above x 0 , the corresponding values of ^ appear 
to reach either a minimum or a maximum value at x = Xq. 



It may be expected, therefore, that the slope at x = xq on the first derived curve is zero and 
changes sign as x passes through the value xq. 


d 2 y 
dx 2 


0 at x 


x 0 and changes sign. 


This is the second derivative test for a point of inflexion. 
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EXAMPLE 


For the curve whose equation is 
we have 

d y 

dx 

The second derivative is zero when x = 0 and changes sign as x passes through the value 
zero. 

Hence, the stationary point (0, 5) found previously is a point of inflexion. 

Notes: 

(i) For a stationary point of inflexion, it is not enough that 


without also the change of sign. 

For example, the curve whose equation is 

4 

y = x 

is easily shown (by Method 1) to have a local minimum at the point (0, 0); and yet, for this 
curve, = 0 at x = 0. 



y = 5 + x 3 , 


d.^'27 

= 3a; 2 and — r = 6a; 
dar 



(ii) Some curves contain what are called “ordinary points of inflexion”. They are not 
stationary points and hence, ^ ^ 0; but the rest of the condition for a point of inflexion 
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still holds. That is, 


d 2 y 

dx 2 


= 0 and changes sign. 


EXAMPLE 


For the curve whose equation is 


y = x 3 + x, 


we have 


dy 

da: 


3a: 2 + 1 and 


d 2 y 
dx 2 


Hence, there are no stationary points at all; but 3-5 
passes through x = 0. 


: 6a:. 

0 at a; 


0 and changes sign as x 


That is, there is an ordinary point of inflexion at (0, 0). 



Notes: 

(i) In any interval of the rr-axis, the greatest value of a function of x will be either the 
greatest maximum or possibly the value at one end of the interval. Similarly, the least value 
of the function will be either the smallest minimum or possibly the value at one end of the 
interval. 

(ii) In sketching a curve whose maxima, minima and points of inflexion are known, it may 
also be necessary to determine, from the equation of the curve, its points of intersection with 
the axes of reference. 
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11.2.6 EXERCISES 


1. Determine the local maxima, local minima and points of inflexion (including ordinary 
points of inflexion) on the curves whose equations are given in the following: 

(a) 

y = x 3 — 6x 2 + 9x + 6; 


(b) 

1 

y — x -\ — . 
x 


In each case, give also a sketch of the curve. 

2. Show that the curve whose equation is 

v = 2JTT + 111(21 + 1( 

has a local minimum at a point on the y-axis. 

3. The horse-power, P , transmitted by a belt is given by 


where k is a constant, v is the speed of the belt, T is the tension on the driving side 
and w is the weight per unit length of the belt. Determine the speed for which the 
horse-power is a maximum. 

4. For x lying in the interval —3 < x < 5, determine the least and greatest values of the 
function 

a; 3 - 12a; + 20 
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11.2.7 ANSWERS TO EXERCISES 


1. (a) Local maximum at (1, 10), local minumum at (3,6), ordinary point of inflexion at 

(2,8); 



(b) Local maximum at (-1,-2), local minimum at (1,2). 



2. Local minimum at the point (0, 1). 

3. The horse-power is maximum when 

v = 

4. The greatest value is 85 at (5,85); the least value is 4 at (2,4). 
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UNIT 11.3 - DIFFERENTIATION APPLICATIONS 3 


CURVATURE 

11.3.1 INTRODUCTION 

In the discussion which follows, consideration will be given to a method of measuring the 
“tightness of bends” on a curve. This measure will be called “curvature” and its 
definition will imply that very tight bends have large curvature. 

We shall also need to distinguish between curves which are “concave upwards” (U), hav- 
ing positive curvature, and curves which are “concave downwards” (fl), having negative 
curvature. 

DEFINITION 

Suppose we are given a curve whose equation is y — f(x); and suppose that 6 is the angle 
made with the positive x-axis by the tangent to the curve at a point, P(x, y), on it. If s is the 
distance to P, measured along the curve from some fixed point, F, on it then the curvature, 
k, at P, is defined as the rate of increase of 6 with respect to s. 
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EXAMPLE 


Determine the curvature at any point of a circle with radius a. 

Solution 



We shall let A be a point on the circle at which the tangent is inclined to the positive x-axis 
at an angle, 6 , and let B be a point (close to A) at which the tangent is inclined to the 
positive x-axis at an angle, 6 + 56. The length of the arc, AB, will be called 5s, where we 
shall assume that distances, s, are measured along the circle in a counter-clockwise sense 
from the fixed point, F. 

The diagram shows that 56 is both the angle between the two tangents and the angle 
subtended at the centre of the circle by the arc, AB. 

Thus, 5s = a56 which can be written 


56 _ 1 
5s a 

Allowing 56, and hence 5s, to approach zero, we conclude that 

d 6 1 

« = t = -• 
ds a 

We note, however, that, for the lower half of the circle, 6 increases as s increases, while, 
in the upper half of the circle, 6 decreases as s increases. The curvature will therefore be 
positive for the lower half (which is concave upwards) and negative for the upper half (which 
is concave downwards). 
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Summary 

The curvature at any point of a circle is numerically equal to the reciprocal of the radius. 

11.3.2 CURVATURE IN CARTESIAN CO-ORDINATES 

Given a curve whose equation is y — f(x), suppose P (x,y) and Q(x + 5x, y + Sy) are two 
neighbouring points on it which are separated by a distance of 5s along the curve. 



In this diagram, we may observe that 


and also that 


dy 

da; 


tan# 

Sx^O OX 


d x 5x 

— = lim — = cos#, 
ds <5s^0 f)s 


The curvature may therefore be evaluated as follows: 


But, 


d e 

ds 


dd da; 

da; ds 


— . cosd. 

da; 


d 9 

da; 


d 

da; 


tan 1 


dy 

da; 


1 d 2 y 

i + {tf dx2 ' 
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Finally, 


cos 9 


1 

sec 9 


a/1 + tan 2 d 





and so, 


k — ± 


d 2 y 
dx 2 


_ 3 * 



Notes: 

(i) For a curve which is concave upwards at a particular point, the gradient, will increase 
as x increases through the point. Hence, will be positive at the point. 

(ii) For a curve which is concave downwards at a particular point, the gradient, will 
decrease as x increases through the point. Hence, will be negative at the point. 

(ii) In future, therefore, we may allow the value of the curvature to take the same sign as 
giving the formula 


k, = 


d 2 y 

dx 2 

1 + £1 1 


EXAMPLE 

Use the cartesian formula to determine the curvature at any point on the circle, centre (0, 0) 
with radius a. 

Solution 

The equation of the circle is 


2,2 2 
x y — ( i . 


which means that, for the upper half, 


4 



and, for the lower half, 


y 


= Va 2 — 


x- 


y 


= 


X 2 . 


Considering, firstly, the upper half, 


d y x 

da; y/a 2 -x 2 


and 


d 2 y _ x ' 2 + y/ a 2_ x 2 _ a 2 

dx 2 a 2 - x 2 (a 2 -a; 2 )i' 


Therefore, 


(a 2 — x 2 )^ ® 1 

a3_ 

a z —x z J 


For the lower half of the circle, 


k = 


1 

a 
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11.3.3 EXERCISES 


In the following questions, state your answer in decimals correct to three places of decimals: 

1. Calculate the curvature at the point (—1,3) on the curve whose equation is 

y — x + 3x 2 — x 3 . 

2. Calculate the curvature at the origin on the curve whose equation is 

9 

X — X 

y ~ 1 + x 2 ' 

3. Calculate the curvature at the point (1, 1) on the curve whose equation is 

x 3 — 2 xy + y 3 = 0. 

4. Calculate the curvature at the point for which 9 = 30° on the curve whose parametric 
equations are 

1 

x — 1 + sin 9 and y = sin 9 cos 29. 

y 2 

11.3.4 ANSWERS TO EXERCISES 

1. k — 0.023 

2. K = -0.707 

3. K = -5.650 

4. k = 0.179 
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UNIT 11.4 DIFFERENTIATION APPLICATIONS 4 


CIRCLE, RADIUS AND CENTRE OF CURVATURE 
11.4.1 INTRODUCTION 

At a point, P, on a given curve, suppose we were to draw a circle which just touches the 
curve and has the same value of the curvature (including its sign). This circle is called the 
“circle of curvature at P” . Its radius, p, is called the “radius of curvature at P” and 
its centre is called the “centre of curvature at P”. 
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11.4.2 RADIUS OF CURVATURE 


Using the earlier examples on the circle (Unit 11.3), we conclude that, if the curvature at P 
is k, then p = \ and, hence, 


P = 


ds 

d0‘ 


Furthermore, in cartesian co-ordinates, 


_ 3 



P= ^ 

dx 2 


Note: 

If we are interested in the radius of curvature simply as a length, then, for curves with 
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negative curvature, we would use only the numerical value obtained in the above formula. 
However, in a later discussion, it is necessary to use the appropriate sign for the radius of 
curvature. 

EXAMPLE 

Calculate the radius of curvature at the point (0.5, —1) of the curve whose equation is 

y 2 = 2x. 


Solution 

Differentiating implicitly, 


That is, 


Also 



= 2 . 


dy = 1 
dx y 


d 2 y Id y 1 

dx 2 y 2 dx y 3 

Hence, at the point (0.5, —1), ^ = — 1 and = 1. 

We conclude that 


P 


(l + l) f 
1 


2y/2. 


11.4.3 CENTRE OF CURVATURE 

We shall consider a point, ( x 0 ,y 0 ), on an arc of a curve whose equation is y = f(x) and for 
which the curvature is positive, the arc lying in the first quadrant. But it may be shown 
that the formulae obtained for the co-ordinates, ( x c ,y c ), of the centre of curvature apply in 
any situation, provided that the curvature is associated with its appropriate sign. 
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c (x c ,y c ) 



From the diagram, 


x c = x 0 - psin9, 
Vc = Vo + P cos 6. 


Note: 

Although the formulae apply in any situation, it is a good idea to sketch the curve in order 
estimate, roughly, where the centre of curvature is going to be. This is especially important 
where there is uncertainty about the precise value of the angle 6. 

EXAMPLE 

Determine the centre of curvature at the point (0.5, —1) of the curve whose equation is 

y 2 = 2x. 


Solution 

From the earlier example on calculating radius of curvature, 


d y 1 d 2 y 

— = - and — - 
da; y dor 


giving f x = 


-1, ^| = 1 and p = 2\/2 at the point (0.5, —1). 
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The diagram shows that the co-ordinates, ( x c ,y c ), of the centre of curvature will be such 
that x c > 0.5 and y c > — 1. This will be so provided that the angle, 9 , is a negative acute 
angle; (that is, its cosine will be positive and its sine will be negative). 

In fact, 


6 = tan 1 (— 1) = —45°. 


Hence, 


x c = 0.5 -2v / 2sin(— 45°), 
y c = -l + 2v / 2cos(-45°). 


That is, 


x c = 2.5 and y c — 1. 
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11.4.4 EXERCISES 


In the following questions, state your results in decimals correct to three places of decimals: 

1. Calculate the radius of curvature at the point (—1,3) on the curve whose equation is 

y = x + 3x 1 2 — x 3 

and hence obtain the co-ordinates of the centre of curvature. 

2. Calculate the radius of curvature at the origin on the curve whose equation is 

9 

X ~ X 

y = l + x 2 

and hence obtain the co-ordinates of the centre of curvature. 

3. Calculate the radius of curvature at the point (1, 1) on the curve whose equation is 

a; 3 — 2 xy + y A = 0 

and hence obtain the co-ordinates of the centre of curvature. 

4. Calculate the radius of curvature at the point for which 9 = 30° on the curve whose 
parametric equations are 

1 

x — 1 + sin 6 and y = sin 9 — - cos 29 
and hence obtain the co-ordinates of the centre of curvature. 

11.4.5 ANSWERS TO EXERCISES 

1 . p = 43.6705, (x c , y c ) = (42.333, 8.417). 

2. p = -1.414 (x c ,y c ) = (1,-1). 

3. p = -0.177 (x c ,y c ) = (0.875,0.875). 

4. p = 0.590 (x c ,y c = (-3.500,2.750). 
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UNIT 11.5 - DIFFERENTIATION APPLICATIONS 5 


MACLAURIN’S AND TAYLOR’S SERIES 
11.5.1 MACLAURIN’S SERIES 

One of the simplest kinds of function to deal with, in either algebra or calculus, is a polyno- 
mial (see Unit 1.8). Polynomials are easy to substitute numerical values into and they are 
easy to differentiate. 

One useful application of the present section is to approximate, to a polynomial, functions 
which are not already in polynomial form. 

THE GENERAL THEORY 

Suppose f(x ) is a given function of x which is not in the form of a polynomial, and let us 
assume that it may be expressed in the form of an infinite series of ascending powers of x ; 
that is, a “power series”, (see Unit 2.4). 

More specifically, we assume that 

f(x ) = oo + a\x + CL 2 X 2 + CI 3 X 3 + 04a; 4 + . . . 

This assumption cannot be justified unless there is a way of determining the “coefficients”, 
a 0 , ai, d2, d3, d4, etc.; but this is possible as an application of differentiation as we now show: 

(a) Firstly, if we substitute x = 0 into the assumed formula for f(x), we obtain /( 0) = a 0 ; 
in other words, 


d 0 = /( 0). 

(b) Secondly, if we differentiate the assumed formula for f(x) once with respect to x, we 
obtain 

f'(x ) = di + 2 d 2 X + 3d3a; 2 + 4d4X 3 + . . . 
which, on substituting x = 0, gives /'( 0) = dp in other words, 

«i = /'( 0). 
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(c) Differentiating a second time leads to the result that 


f"(x ) = 2a 2 + (3 x 2)030; + (4 x 3)a4m 2 + . . . 
which, on substituting x = 0 gives f"( 0) = 2 a 2 ; in other words, 

«2 = )/"(0). 

(d) Differentiating yet again leads to the result that 

f”\x) = (3 x 2)a 3 + (4 x 3 x 2 )a 4 x + . . . 
which, on substituting x = 0 gives f"\ 0) = (3 x 2)a 3 ; in other words, 

% = 4/'"(o). 

(e) Continuing this process with further differentiation will lead to the general formula 

On = V'”’* 0 *' 

Tl\ 

where /^(0) means the value, at x = 0 of the n-tii derivative of f(x). 

Summary 

m = /( 0 ) + xf\o) + p>(0) + (o) + . . . 

This is called the “Maclaurin’s series for f(x) n . 

Notes: 

(i) We must assume, of course, that all of the derivatives of f(x) exist at x = 0 in the first 
place; otherwise the above result is invalid. 

It is also necessary to examine, for convergence or divergence, the Maclaurin’s series obtained 
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for a particular function. The result may not be used when the series diverges; (see Units 
2.3 and 2.4). 

(b) If x is small and it is possible to neglect powers of x after the n-tli power, then Maclaurin’s 
series approximates f(x) to a polynomial of degree n. 

11.5.2 STANDARD SERIES 

Here, we determine the Maclaurin’s series for some of the functions which occur frequently 
in the applications of mathematics to science and engineering. The ranges of values of x for 
which the results are valid will be stated without proof. 


1. The Exponential Series 

(i) f(x) = e x -, 

(ii) f(x) = e x \ 

(iii) f"(x) = e x \ 

(iv) f"{x) = e x \ 

(v) f( w )( x ) = e x ; 

Thus, 


hence, /( 0) = e° = 1. 
hence, /'( 0) = e° = 1. 
hence, /"( 0) = e° = 1. 
hence, /'"( 0) = e° = 1. 
hence, f^ lv \ 0) = e° = 1. 


2 S 4 

CC'* 

— 1 ~ ' r ~ 2! ~ 3! ~;41~ 


and it may be shown that this series is valid for all values of x. 


2. The Sine Series 

(i) f(x ) = sin re; 

(ii) fix) = cosrc; 

(iii) f"(x) = — sin re; 

(iv) f"'(x) = — cosrr; 

( v ) f( wS >(x) = sin re; 

(vi) f( v \x) = cosrr; 
Thus, 


hence, /( 0) = sinO = 0. 
hence, /'( 0) = cosO = 1. 
hence, /"( 0) = — sinO = 0. 
hence, f"{ 0) = — cosO = — 1. 
hence, 0) = sinO = 0. 
hence, /^( 0) = cosO = 1. 


^ ‘a 

sin x — x - — + — 


and it may be shown that this series is valid for all values of x. 
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3. The Cosine Series 

(i) f(x) = cos a;; 

(ii) f(x) = — sin a;; 

(iii) f"{x) = — cos a;; 

(iv) f'"(x) = sin a;; 

( v ) f(i v )(x) = cos a:; 
Thus, 


hence, /( 0) = cosO = 1. 
hence, /'( 0) = — sinO = 0. 
hence, f"{ 0) = — cosO = —1. 
hence, /'"( 0) = sinO = 0. 
hence, f iv \ 0) = cosO = 1. 


cos a; = 1 


2 4 

X X 

2\ + 4l 


and it may be shown that this series is valid for all values of x. 

The Logarithmic Series 

It is not possible to find a Maclaurin’s series for the function In a;, since neither the 
function nor its derivatives exist at x = 0. 

As an alternative, we may consider the function ln(l + x) instead. 


(i) f(x) = ln(l + a;); 

(ii) f{x) = 

(iii) f”{x) = 

(iv) f"(x) = 

(v) f iv \x) = 2x3 

Thus, 


(l+x ) 4 


hence, /( 0) = In 1 = 0. 
hence, /'( 0) = 1. 
hence, /"( 0) = 1. 
hence, f"{ 0) = 2. 
hence, 0) = —(2x3). 


2 3 4 

/p* nr* ^ 

- / \ tv tA/ / \ tv 

ln(l + x) = x - 2 j- + 2 * gj- - ( 2 x 3 )^j- + 


which simplifies to 


/p" /p o /p a 

. / . \ tv IV IV 

ln(l + a; ) = a; -y + y- ^: + ••• 

and it may be shown that this series is valid for — 1 < x < 1. 

5. The Binomial Series 

The statement of the Binomial Formula has already appeared in Unit 2.2; and it was 
seen there that 

(a) When n is a positive integer, the expansion of (1 + x) n in ascending powers of x is 
a finite series; 
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(b) When n is a negative integer or a fraction, the expansion of (1 + x) n in ascending 
powers of x is an infinite series. 

Here, we examine the proof of the Binomial Formula. 

(i) f(x) = (1 + x) n ; hence, /( 0) = 1. 

(ii) f'(x) = n( 1 +x) n ~ 1 ; hence, /'( 0) = n. 

(iii) f"(x) = n(n — 1)(1 + x) n ~ 2 ; hence, /"( 0) = n(n — 1). 

(iv) f"'(x) = n(n — l)(n — 2)(1 + x) n ~ z \ hence, /'"( 0) = n(n — l)(n — 2). 

( v ) f( w \x) = n(n — l)(n — 2)(n — 3)(1 + x) n_4 ; hence, f^ v \ 0) = n(n — l)(n — 2)(n — 3). 
Thus, 


(1 + x) n = 1 +nx + 


n(n 


1 ) 


2 ! 


2 n(n — l)(n — 2) o n(n — l)(n — 2)(n — 3) 4 
x 2 + — V — -x* + 


3! 


4! 


If n is a positive integer, all of the derivatives of (1 + x) n after the n-th derivative are 
identically equal to zero; so the series is a finite series ending with the term in x n . 

In all other cases, the series is an infinite series and it may be shown that it is valid 
whenever — 1 < x < 1. 


EXAMPLES 


1. Use the Maclaurin’s series for sin x to evaluate 


x + sm x 

inn — . 

x(x + 1) 


Solution 

Substituting the series for sin x gives 


lim 


X + X — " 3 !"+ 5 !" — 

X 2 + X 


= lim 

x — >0 


O/r* _L_ _ 

« ' 120 


X 2 + X 


r\ xr_ , x * _ 

= lim — — — — 120 = 2 . 

x^0 x + 1 
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2. Use a Maclaurin’s series to evaluate a/1.01 correct to six places of decimals. 

Solution 

We shall consider the expansion of the function (1 + x)^ and then substitute x = 0.01. 


(1 + x) 2 = 1 + -x + 


i , (DHL. (l) (-D (-D 


2 ! 


-x + 


3! 


x* + ... 


That is, 


/ . i 1 i i 

(1 + x)2 = 1 + -X - -X + —x + • • • 
z o lo 


Substituting x = 0.01 gives 


vToI = 1 + - x 0.01 - - x 0.0001 + — x 0.000001 
2 8 16 


= 1 + 0.005 - 0.0000125 + 0.0000000625 - . . . 

The fourth term will not affect the sixth decimal place in the result given by the first 
three terms; and this is equal to 1.004988 correct to six places of decimals. 

3. Assuming the Maclaurin’s series for e x and sin x and assuming that they may be multi- 
plied together term-by-term, obtain the expansion of e^sinx in ascending powers of x 
as far as the term in x 5 . 

Solution 


x~ 


x° 


X 


e x sin x = ( 1 + a: + ^ ! +^[ + / ! + 


X — r T — — T 

, 3! 120 


q c; 

CC'~' 

x ~j + m 


x 


or 


x 


x 




X ~J + ^~W + J + 2i + 


X 


X 


= x + x H b ... 

3 30 


11.5.3 TAYLOR’S SERIES 

A useful consequence of Maclaurin’s series is known as Taylor’s series and one form of it may 
be stated as follows: 
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Proof: 


f(x + h) = f(h ) + xf\h) + ^f(h) + | \f"{h) + ... 


To obtain this result from Maclaurin’s series, we simply let f(x + h ) = F(x). Then, 

F(x) = F( 0) + xF\ 0) + ^F"(0) + ~ F"'(0) + . . 

But, F(0) = f(h), F'(0) = f'{h ), F"(0) = f"(h), F'"( 0) = f'"(h),. . . which proves the 
result. 

Note: An alternative form of Taylor’s series, often used for approximations, may be obtained 
by interchanging the symbols x and h to give 


f(x + h) = f(x) + hf(x ) + ^ 7 "(>) + + • • • 


EXAMPLE 

Given that sin | = cos | use Taylor’s series to evaluate sin(a: + h ), correct to five 

places of decimals, in the case when x — ^ and h = 0.01. 

Solution 

Using the sequence of derivatives as in the Maclaurin’s series for sin a;, we have 

.... h 2 . h 3 

sm (a; + h ) = sm x + n cos x — — sm x — — cos x + 

O . 

Subsituting x — | and h = 0.01, we obtain 


sin 




1 

V2 


^1 + 0.01 - 


(0.01) 2 

2 ! 


(0.01) 3 

3! 


+ . . . 



+ 0.01 - 0.00005 - 0.000000017 + ) 
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The fourth term does not affect the fifth decimal place in the sum of the first three terms; 
and so 


sin 



+ 0.01 



x 1.00995 ~ 0.71414 


11.5.4 EXERCISES 


1. Determine the first three non-vanishing terms of the Maclaurin’s series for the function 
secx. 


2. Determine the Maelaurin’s series for the function tana: as far as the term in x 5 . 

3. Determine the Maelaurin’s series for the function ln(l + e x ) as far as the term in x 4 . 

4. Use the Maclaurin’s series for the function e x to deduce the expansion, in ascending 
powers of x of the function e~ x and then use these two series to obtain the expansion, 
in ascending powers of x, of the functions 

(a) 


(b) 


e x + e x 

2 


cosh a:); 



sinhx). 


5. Use the Maclaurin’s series for the function cos a; and the Binomial Series for the function 
to obtain the expansion of the function 


cos a; 
1 + x 


in ascending powers of x as far as the term in x 4 . 

6. From the Maclaurin’s series for the function cos x, deduce the expansions of the functions 
cos 2a; and sin 2 a; as far as the term in x 4 . 



7. Use appropriate Maclaurin’s series to evaluate the following limits: 


(a) 

e * + e -x _ 2 

2 cos 2x — 2 


(b) 

lim 


sirrx — x 2 cos x 

X 4 


lim 

x —>0 


8. Use a Maclaurin’s series to evaluate 3 \/l.05 correct to four places of decimals. 

9. Expand cos(x + h ) as a series of ascending powers of h. 

Given that sin | = \ and cos | = 7^, evaluate cos(a: + h ), correct to five places of 
decimals, in the case when x = 5 and h = —0.05. 

D 


11.5.5 ANSWERS TO EXERCISES 


1. 


2 . 


3. 


4. (a) 


(b) 


2 4 

„ X X 

1 + y + v + 


X + -r* + 


2x 5 

15 


x 


In 2 H 1- 

2 


a; 2 

¥ 



+ . . . 


2 4 

X X 

cosh a; = l + w7 + Tr + 
2! 4! 


0C'~ J X 

sinh x — x 4 , 4 + 

3! 5! 
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5. 


x 2 

!-*+- 


x 3 13a; 4 
2 + ~24T 


6 . 


cos 2x 


1-2x 2 + 


2x 4 

~3~ 


. o 2 
sin x = x 



7. (a) -f, (b) l 

8. 1.0164 

9. 0.74156 
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UNIT 11.6 - DIFFERENTIATION APPLICATIONS 6 


SMALL INCREMENTS AND SMALL ERRORS 
11.6.1 SMALL INCREMENTS 

Given that a dependent variable, y, and an independent variable, x are related by means of 
the formula 


V = f(x), 

suppose that x is subject to a small “increment”, Sx, 

In the present context we use the term “increment” to mean that Sx is positive when x is 
increased, but negative when x is decreased. 

The exact value of the corresponding increment, Sy, in y is given by 


5y = f(x + Sx) - f(x), 

but this can often be a cumbersome expression to evaluate. 

However, since Sx is small, we may recall, from the definition of a derivative (Unit 10.2), 
that 


f(x + Sx) - f(x) ~ d y 
Sx da; 


That is, 


Sy d y 

• 

Sx da;’ 


and we may conclude that 


Sy ~ 


dy 

da; 


Sx. 


For a diagramatic approach to this approximation for the increment in y, let us consider the 
graph of y against x in the neighbourhood of the two points P(a;, y) and Q(x + Sx, y + Sy) 
on the curve whose equation is y — f[x). 


1 




In the diagram, PR = Sx, QR = Sy and the gradient of the line PS is given by the value of 
^ at P. 

ax 

Taking SR as an approximation to QR, we obtain 


In other words, 


Hence, 


SR 

PR 


d y 

da; 


SR 

Sx 


dy 

da; 


J p 


Sy ~ 


dy 

da; 


Sx, 


which is the same result as before. 

Notes: 

(i) The quantity ^ Sx is known as the “total differential of y” (or simply the “differential 
of y”). It provides an approximation (including the appropriate sign) for the increment, 
Sy, in y subject to an increment of Sx in x. 
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(ii) It is important not to use the word “differential” when referring to a “derivative”. 
Rather, the correct alternative to “derivative” is “differential coefficient” . 

(iii) A more rigorous approach to the calculation of Sy is to use the result known as “Taylor’s 
Theorem” (see Unit 11.5) which, in this context, would give the formula 

f(x + Sx) = f(x) + f\x)Sx + ^(Sx) 2 + J (Sx) 3 + . . . . 

Hence, if Sx is small enough for powers of two and above to be neglected, then 

f(x + Sx) — f(x) ~ f'(x)Sx 

to the first order of approximation. 

EXAMPLES 

1. If a square has side rcms., determine both the exact and the approximate values of the 
increment in the area Herns 2 . when x is increased by Sx. 

Solution 

(a) Exact Method 

Sx 
x 


The area is given by the formula 

A = x 2 . 

If x increases by Sx, then the increase, SA, in A may be obtained from the formula 
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A + 5 A = (x + 5x) 2 = x 2 + 2xSx + (Sx) 2 . 


That is, 


SA = 2xSx + (5x) 2 . 


(b) Approximate Method 

Here, we use 


to give 


dA 

da; 


= 2x 


5 A ~ 2xSx; 


and we observe from the diagram that the two results differ only by the area of the 
small square, with side Sx. 


calculate, approximately, the change in y when x increases from 5 to 5.03. 

Solution 

We have 


dy 

da; 


e X (1 - a;), 


so that 


5y ~ e *(1 — x)5x, 


where x = 5 and 5x = 0.3. 
Hence, 


5y ~ e“ 5 .(l - 5). (0.3) ~ -0.00809, 
showing a decrease of 0.00809 in y. 

We may compare this with the exact value which is given by 


Sy = 5.3e -5 ' 3 - 5e" 5 ~ -0.00723 


4 



3. If 


y = xe , 

determine, in terms of x, the percentage change in y when x is increased by 2%. 

Solution 

Once again, we have 


Sy = e X (1 — x)5x; 

but, this time, Sx = 0.02a;, so that 

Sy = e~ x (l — x) x 0.02a;. 

The percentage change in y is given by 

Sy e~ x (l — x) x 0.02a; . . 

— x 100 = x 100 = 2 1 - x . 

y xe x 

That is, y increases by 2(1 — x)%, which will be positive when x < 1 and negative when 
x > 1. 


Note: 

It is usually more meaningful to discuss increments in the form of a percentage, since this 
gives a better idea of how much a variable has changed in porportion to its original value. 

11.6.2 SMALL ERRORS 

In the functional relationship 


y = f(x), 


let us suppose that x is known to be subject to an error in measurement; then we consider 
what error will be likely in the calculated value of y. 

In particular, suppose x is known to be too large by a small amount, Sx. in which case the 
correct value of x could be obtained if we decreased it by Sx: or, what amounts to the same 
thing, if we increased it by —Sx. 

Correspondingly, the value of y will increase by approximately — (jy&r; that is, y will 
decrease by approximately ^ Sx . 
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Summary 

We conclude that, if x is too large by an amount Sx, then y is too large by approximately 
^<5a;; though, ofcourse, if ^ itself is negative, y will be too small when x is too large and 
vice versa. 

EXAMPLES 

1. If 

y = x 2 sin a;, 

calculate, approximately, the error in y when x is measured as 3, but this measurement 
is subsequently discovered to be too large by 0.06. 

Solution 

We have 

d y 2 

— = x cos x + 2x sm x 
ax 

and, hence, 


5y ~ ( x 2 cos x + 2x sin x)5x, 


where x = 3 and Sx = 0.06. 

The error in y is therefore given approximately by 

Sy — (3 2 cos 3 + 6 sin 3) x 0.06 ~ —0.4838 
That is, y is too small by approximately 0.4838. 


determine approximately, in terms of x, the percentage error in y when x is subject to 
an error of 5%. 

Solution 

We have 

d y 1 + x — x 1 

da; (1 + a;) 2 (1+a;) 2 ’ 

so that 
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5y ~ 


1 

(1 + x) 2 


5x, 


where Sx = 0.05a:. 

The percentage error in y is thus given by 

5y 1 x + 1 5 

— x 100 ~ t T77 x 0.05a: x x 100 = . 

y (1 + x) 2 x 1 + x 

Hence, y is too large by approximately YY% which will be positive when x > — 1 and 
negative when x < — 1. 

11.6.3 EXERCISES 


1. If 


y = 


J2x 


X 


calculate, approximately, the change in y when x is increased from 1 to 1.0025. 

State your answer correct to three significant figures. 

2. If 

y = (2a: + l) 4 5 , 

determine approximately, in terms of x , the percentage change in y when x increases 
by 0.1%. 


3. If 


y = x 3 In x , 


calculate approximately, correct to the nearest integer, the error in y when x is measured 
as 4, but this measurement is subsequently discovered to be too small by 0.12. 


4. If 


y = cos(3a: 2 + 2), 

determine approximately, in terms of x , the percentage error in y if x is too large by 


You may assume that 3ar + 2 lies between 7 r and ^ . 
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11.6.4 ANSWERS TO EXERCISES 

1. y increases by approximately 0.0185. 

2. y increases by approximately ( 2 x+i) % 

3. y is too small by approximately 10. 

4. y is too small by approximately — 12a; 2 tan(3a; 2 + 2). 
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UNIT 12.1 - INTEGRATION 1 - ELEMENTARY INDEFINITE INTEGRALS 


12.1.1 THE DEFINITION OF AN INTEGRAL 


In Differential Calculus, we are given functions of x and asked to obtain their derivatives; 
but, in Integral Calculus, we are given functions of x and asked what they are the derivatives 
of. The process of answering this question is called “integration” . 

In other words integration is the reverse of differentiation. 


DEFINITION 

Given a function fix), another function, z, such that 


d£ 

da; 


= fix) 


is called an integral of f(x) with respect to x. 


Notes: 

(i) The above definition refers to an integral of f(x) rather than the integral of f(x). This 
is because, having found a possible function, z, such that 


dz 

da; 


fix), 


z + C is also an integral for any constant value, C . 


(ii) We call z + C the “indefinite integral of f(x) with respect to a;” and we write 


J f(x) dx = z + C. 


(iii) C is an arbitrary constant called the “constant of integration” . 

(iv) The symbol da; does not denote a number; it is to be taken as a label indicating the 
variable with respect to which we are integrating. It may seem obvious that this will be x, 
but it could happen, for instance, that x is already dependent upon some other variable, t, in 
which case it would be vital to indicate the variable with respect to which we are integrating. 

(v) In any integration problem, the function being integrated is called the “integrand”. 
Result: 

Two functions Z\ and Z2 are both integrals of the same function fix) if and only if they differ 
by a constant. 
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Proof: 

(a) Suppose, firstly, that 

zi-z 2 = C, 

where C is a constant. 

Then, 

That is, 

dzi dz 2 _ 

da; da; 
or 

dz\ dz 2 
da; da; 

From our definition, this shows that both z\ and z 2 are integrals of the same function. 

(b) Secondly, suppose that z\ and z 2 are integrals of the same function. Then 

dz\ dz 2 
da; da; 


That is, 


dzt dz 2 
— = 0 , 

da; da; 


Hence, 


where C may be any constant. 




Z!~Z2 = C, 


ILLUSTRATIONS 

Any result so far encountered in differentiation could be re-stated in reverse as a result on 
integration as shown in the following illustrations: 


3a; 2 da; = ar + C. 


a; 2 da; = — + C. 
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3 . 


4. 


5. 



x n+l 

n + 1 


+ C Provided n^-1. 



i.e. 



In x + C. 


J e x dx = e x + C. 


6 . 

J cos xda; = sin a: + C. 

7. 

J sin icda; = — cos x + C. 


Note: 

Basic integrals of the above kinds may simply be quoted from a table of standard integrals 
in a suitable formula booklet. More advanced integrals are obtainable using the rules to be 
discussed below. 

12.1.2 ELEMENTARY TECHNIQUES OF INTEGRATION 
(a) Linearity 

Suppose f{x) and g(x) are two functions of x while A and B are constants. Then, 

J [Af(x) + Bg(x)]dx = A J f(x)dx + B J g(x) dx. 


The proof follows from the fact that differentiation is already linear and hence the derivative 
of the right hand side is the integrand of the left hand side. The result itself is easily extended 
to linear combinations of three or more functions. 

ILLUSTRATIONS 

1 . 

/ (x 2 + 3x- 7)dx = ^ + 3^ - 7 x + C. 

J O Zj 
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2 . 


3 cos x + 4sec 2 a;)da; = 3 sin x + 4 tan x + C. 


(b) Functions of a Linear Function 
(i) Inspection Method 

Provided the method of differentiating functions of a linear function has been fully under- 
stood, the fastest method of integrating such functions is to examine, by inspection, what 
needs to be differentiated in order to arrive at them. 

EXAMPLES 


1. Determine the indefinite integral 

J (2x + 3) 12 da;. 

Solution 

In order to arrive at the function (2x + 3) 12 by a differentiation process, we would have 
to begin with a function related to (2a; + 3) 13 . 

In fact, 

— \(2x + 3) 13 ] = 13(2x + 3) 12 .2 = 26(2a; + 3) 12 . 
da; L J 

Since this is 26 times the function we are trying to integrate, we may say that 

/ (2x + 3) 12 — + C. 

2. Determine the indefinite integral 

J cos(3 — 5a;)da;. 


Solution 

In order to arrive at the function cos (3 — 5a;) by a differentiation process, we would have 
to begin with a function related to sin(3 — 5a;). 

In fact, 

— [sin(3 — 5a;)] = cos(3 — 5a;). — 5 = — 5cos(3 — 5a;). 

Since this is —5 times the function we are trying to integrate, we may say that 


J cos(3 — 5a;) 


sin(3 — 5a;) 
5 


+ C. 
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3. Determine the indefinite integral 


e 4x+1 dx. 


Solution 

In order to arrive at the function e 4x+1 by a differentiation process, we would have to 
begin with a function related to e 4x+1 itself because the derivative of a power of e always 
contains the same power of e. 

In fact 


4x+l 


dx L 


= e 


4x-\-l 


.4. 


Since this is 4 times the function we are trying to integrate, we may say that 


i 4x+1 dx = 


d 4x-\-1 


+ c. 


4. Determine the indefinite integral 

f — dx. 

J 7x + 3 


Solution 


In order to arrive at the function yy— ^ by a differentiation process, we would have to 
begin with a function related to ln(7a; + 3). 


In fact, 


^[ln(7x + 3)] 


7x + 3 


-.7 


7 

7x + 3 


Since this is 7 times the function we are trying to integrate, we may say that 


[ — i — dx — ln(7x + 3) + c. 
J 7x + 3 7 


Note: 

In each of these examples, we are essentially treating the linear function as if it were a single 
x, then dividing the result by the coefficient of x in that linear function. 

(ii) Substitution Method 

The method to be discussed here will eventually be applied to functions other than functions 
of a linear function; but the latter serve as a useful way of introducing the technique of 

“Integration by Substitution”. 

In the integral of the form J f(ax + b)dx, we may substitute u = ax + b proceeding as follows: 


5 



Suppose 

z = j f(ax + b) da;. 

Then, 

t = f(ax+b) - 

That is, 

P = /(«). 

da; 

But, 

d^ d^ da; da; 

d u da;’du ' du 

Hence, 

" = / /M s d “' 

Note: 



The secret of this integration by substitution formula is that, apart from putting u = ax + b 
into f(ax + b ), we replace the symbol da; with almost as if we had divided da; by d u 

then immediately multiplied by it again, though, strictly, this would not be allowed since da; 
and d u are not numbers. 

EXAMPLES 
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2. Determine the indefinite integral 


z = 


cos(3 — 5a:)da:. 


Solution 

Putting u 
Thus, 


3 — 5a: gives ^ = — 5 and hence ^ = — |. 


z = 


cos u. d u 

5 


1 

5 


sin u + C. 


That is, 


as before. 



sin(3 — 5a:) + C, 


3. Determine the indefinite integral 

z = J e ix+l Ax. 


Solution 

Putting u — 4a: + 1 gives ^ 
Thus, 


4 and, hence, )!' x = 



4 


+ C. 


That is, 


as before. 


z = 


g4x+l 

4 


+ C, 


4. Determine the indefinite integral 


z = 


1 

7x + 3 


da:. 


Solution 

Putting u = 7x + 3 gives ^ = 7 and, hence, ^ = 4 
Thus, 

/• 1 1 1 

z — / -.-du—-]nu + C. 
J u 7 7 

That is, 

z = - ln(7a: + 3) + C, 

as before. 
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12.1.3 EXERCISES 


1. Integrate the following functions with respect to x: 
(a) 


(b) 


(d) 


x 


3 

x 2 : 


x v 


2ar — x + 5; 


a: 3 — 7x 2 + x + 1. 

2. Use a substitution of the form u = ax + 6 in order to determine the following integrals: 


(b) 


(c) 


sin(5x — 6)da;; 


J e 2x+11 dx; 

J (3x + 2) 6 da;. 


3. Write down, by inspection, the indefinite integrals with respect to x of the following 
functions: 


(1 + 2a;) 10 ; 



(b) 


e 


12x+4. 

S 


(c) 


(d) 

(e) 


(f) 


1 

3x - 1 ’ 


sin (3 — 5a;); 


9 

(4 -a;) 5 ’ 


cosec 1 2 (7a; + 1). 


12.1.4 ANSWERS TO EXERCISES 


1. (a) 


(b) 


(c) 


(d) 


(e) 


6 


+ C; 


2 

5 


x 2 + C; 


1 

5a; 5 


+ 




+ 5a; + C; 


1 4 7 3 1 2 

-X - -x + -x + X + C. 

4 3 2 


9 



(b) 


cos(5a: — 6) + C; 


1 2x+ll 

2 


+ C; 


(c) 


3. (a) 


(b) 


(c) 


(d) 


(e) 


±(3 x + 2) 7 + C. 

— (1 + 2 a ;) 11 + C ; 


_1 12x+4 

12 


+ Cj 


tt ln(3x - 1) + C; 

o 

- cos(3 — 5x) + C: 
5 


9 

4(4 — a:) 4 


+ C; 


(f) 


- cot(7a; + 1) + C. 



APPENDIX - A Table of Standard Integrals 


f(x) 

I f(x) da; 

a (const.) 

ax 

x n 

x n+1 / (n + 1) n / —1 

1/x 

In a; 

sin ax 

— (1/a) cos ax 

cos ax 

(1/a) sin ax 

sec 2 ax 

(1/a) tan aa; 

cosec 2 ax 

— (1/a) cot ax 

sec ax. tan ax 

(l/a)secax 

cosec ax. cot ax 

— (l/a)cosecaa; 

g ax 

(1 /a)e ax 

a x 

a x / In a 

sinh ax 

(1/a) cosh ax 

cosh ax 

(1/a) sinh ax 

sech 2 ax 

(1/a) tanh ax 

sech ax. tanh ax 

— (l/a)sechaa; 

cosech ax. coth ax 

— (l/a)cosechaa; 

cot ax 

(1/a) ln(sin ax) 

tan ax 

— (1/a) ln(cos ax) 

tanh ax 

(1/a) ln(cosh ax) 

coth ax 

(1/a) ln(sinh ax) 

i/V(° 2 - x 2 ) 

sin” 1 (x/a) 

r l/(a 2 + x 2 ) 

(l/a)tan _1 (a;/a) 

W(^ 2 + a 2 ) 

sinh -1 (x/a) 
or ln(a; + \/ x 1 + a 2 ) 

1 Nix 1 - a 2 ) 

cosh -1 (x/a) 
or ln(a; + \/x 2 — a?) 

l/(a 2 — x 2 ) 

(l/a)tanh -1 (a:/a) 
or d- In ( ^ ) when a; < a, 

2 a \a—x ) ii* 

j-ln(^+“) when \x > a. 

2a \ x—a ) 1 1 


11 
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UNIT 12.2 - INTEGRATION 2 
INTRODUCTION TO DEFINITE INTEGRALS 

12.2.1 DEFINITION AND EXAMPLES 

So far, all the integrals considered have been “indefinite integrals” since each result has 
contained an arbitrary constant which cannot be assigned a value without further informa- 
tion. 

In practical applications of integration, however, a different kind of integral, called a “defi- 
nite integral” , is encountered and is represented by a numerical value rather than a function 
plus an arbitrary constant. 

Suppose that 

j f(x) dx = g(x) + C. 


Then the symbol 

[ f{x)dx 

J a 

is used to mean 

(Value of g(x) + C at x — b ) minus (Value of g(x) + C at x — a). 
In other words, since C will cancel out, 



g{b) — g(a). 


The right hand side of this statement can also be written 

W)t > 

a notation which is used as the middle stage of a definite integral calculation. 

The values a and b are known as the “lower limit” and “upper limit”, respectively, of 
the definite integral (even when a is larger than b). 

EXAMPLES 

1. Evaluate the definite integral 

7 r 

/ - cosaxhr. 

Jo 


1 



Solution 


7 r 


cos a; da: = [sinxjo = sin — 


sin 0 = 1. 


2. Evaluate the definite integral 



(2x + l) 2 da:. 


Solution 

Using the method of integrating a function of a linear function, we obtain 



(2x + l) 3 


Z! 

6 


3 3 

— ~ 52.67 
6 


Notes: 

(i) If we had decided to multiply out the integrand (2a: + l) 2 before integrating, giving 

4a: 2 + 4x + 1, 

the integration process would have yielded the expression 

a: 3 

4— + 2x 2 + x, 
o 

which differs only from the previous result by the constant value students may like to 
check this. Hence the numerical result for the definite integral will be the same. 

(ii) An alternative method of evaluating the definite integral would be to make the substi- 
tution 

u = 2x + 1. 

But, whenever substitution is used for definite integrals, it is not necessary to return to the 
original variable at the end as long as the limits of integration are changed to the appropriate 
values for u. 


Replacing da: by fUlw (that is, ^d u) and the limits x — 1 and i = 3by« = 2xl + l = 3 
and w = 2x3 + l = 7, respectively, we obtain 


r 7 9I 

u 3 

/ ar-dw = 

h 2 



7 s 

6 


3 3 

— ~ 52.67 

6 


2 



12.2.2 EXERCISES 


Evaluate the following definite integrals: 


1 . 


2 . 


3. 


4. 


5. 


sin 4xdx. 



(x + l) 7 dx. 



sec 2 (a; + n) dx. 



1 

4 + 3x 


dx. 



e 1 ” 7 * dx. 


12.2.3 ANSWERS TO EXERCISES 


1 . 

2 . 


0.375 


32. 


3. 


4. 


5. 


1 . 


-In — or 0.119 approximately 
3 7 


0.388 approximately 


3 



“JUST THE MATHS” 


UNIT NUMBER 
12.3 

INTEGRATION 3 

(The method of completing the square) 



A. J. Hobson 


12.3.1 Introduction and examples 

12.3.2 Exercises 

12.3.3 Answers to exercises 




UNIT 12.3 - INTEGRATION 3 

THE METHOD OF COMPLETING THE SQUARE 

12.3.1 INTRODUCTION AND EXAMPLES 

A substitution such as u = ax + (3 may also be used with integrals of the form 

[ — 77 — da; and / , 0 1 ^=da;, 

J px z + qx + r J y/px z + qx + r 

where, in the first of these, we assume that the quadratic will not factorise into simple linear 
factors; otherwise the method of partial fractions would be used to integrate it (see Unit 
12 . 6 ). 

Note: 

The two types of integral here are often written, for convenience, as 


da; 

px 2 + qx + r 


and 


da; 

y/px 2 + qx + r 


In order to deal with such functions, we shall need to quote standard results which may 
be deduced from previous ones developed in the differentiation of inverse trigonometric and 
hyperbolic functions. 

They are as follows: 


1. 


2 . 


3. 


4 . 



1 

-tan — 1 - C. 
a a 


I 



da; = sin x — 


x 


+ C. 


[ —— _ 
J V a 2 + x 2 


da; 


rjQ 

sinh -1 — hd or ln(x + \/ x 2 + a 2 ) + C. 
a 



cosh + C or ln(a; + \j x 2 — a 2 ) + C. 
a 


1 



5 . 


or 


and 


EXAMPLES 


/ 


1 


a? — x 2 


da; 


1 , -l x 

-tanh 1 — hC; 
a a 


1 , fa 4- x\ , . . 

— m + C when m < a, 

2 a \a — xj 


— In ( ] + C when Ixl > a. 

2 a \x — a/ 


1. Determine the indefinite integral 


z = 


da; 

\/a; 2 + 2a; — 3 


Solution 

Completing the square in the quadratic expression gives 

x 2 + 2a; — 3 = (x + l) 2 — 4 = (x + l) 2 — 2 2 . 


Hence, 


Putting u = x + 1 gives ^ 
Thus, 


giving 


z = 


da; 

(x + l) 2 — 2 2 


1; and so 4^ = 1. 


z = 


d u 

Vu 2 - 2 2 ’ 


2 = In 


71 


+ Vw 2 - 2 2 " 


+ C. 


Returning to the variable, x, we have 


z = In 


x + 1 + \/ x 2 + 2a; — 3 


+ C. 


2. Evaluate the definite integral 



da; 

a; 2 — 6a; + 25’ 


2 



Solution 

Completing the square in the quadratic expression gives 

x 2 -6x + 25 = (x- 3) 2 + 16. 


Hence, 



da; 

(x — 3) 2 + 16’ 


Putting u = x — 3, we obtain ^ 
Thus, 


= 1; and so ^ = 1. 



d u 

u 2 + 16’ 


giving 


z = 


1 _ i u 

-tan — 
4 4 


JO 


7 r 

16' 


Alternatively, without changing the original limits of integration, 


z = 


1 _,x 

-tan — 
4 


-I' 

- 3 


Note: 

In cases like the two examples discussed above, when ^ = 1 and therefore 4^ = 1, it seems 
pointless to go through the laborious process of actually making the substitution in detail. 
All we need to do is to treat the linear expression within the completed square as if it were 
a single x, then write the result straight down ! 

12.3.2 EXERCISES 

1. Use a table of standard integrals to write down the indefinite integrals of the following 
functions: 

(a) 

1 

\/4 — x 2 1 

(b) 

1 

9 + x 2 ' 


3 



1 


\J x 2 - 7' 

2. By completing the square, evaluate the following definite integrals: 
(a) 


(b) 


l-i x 2 + 2x + 2’ 


r 1 da; 
lo V-l — 2 a; — 


x- 


12.3.3 ANSWERS TO EXERCISES 


1. (a) 


■ -i x 

sm — 

2 


+ C; 


(b) 


(c) 


1 ,i 

-tan — 
3 3 


+ C; 


ln(x + Vx 2 — 7) + C. 


2* (a) 


(b) 


tan 1 (a; + 1) 


] a/3—1 
. -l 


7T 

3 ’ 



x + 1] 1 


7T 7T 
2 ~ 6 


71 

3' 


4 
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UNIT 12.4 - INTEGRATION 4 

INTEGRATION BY SUBSTITUTION IN GENERAL 
12.4.1 EXAMPLES USING THE STANDARD FORMULA 

With any integral 

J f(x)dx, 

it may be convenient to make some kind of substitution relating the variable, x, to a new 
variable, u. In such cases, we may use the formula discussed in Unit 12.1, namely 

J f(x) dx = J f(x)^du, 

where it is assumed that, on the right hand side, the integrand has been expressed wholly 
in terms of u. 

For this Unit, substitutions other than linear ones will be given in the problems to be solved. 

EXAMPLES 


1. Use the substitution x 


a sin u to show that 
f da; 


V~o 


= sm 


-i 


x 


x z 


+ C. 


Solution 

To be precise, we shall assume for simplicity that u is the acute angle for which x = 
a sin u. In effect, we shall be making the substitution u = sin” 1 ^ using the principal 
value of the inverse function; we can certainly do this because the expression \J a 2 — x 2 
requires that —a < x < a. 

If x = a sin-w, then ^ = acosu, so that the integral becomes 


acosu 


V a 2 — a 2 sin 2 u 


-du. 


But, from trigonometric identities, 



— a 2 sin 2 u = acosu, 


both sides being positive when u is an acute angle. 


We are thus left with 



u + C = sm 1 — 1 -C. 

a 


1 



2. Use the substitution u 


-j: to determine the indefinite integral 
f da; 


z = 


x 


y/1 + x 2 


Solution 

Converting the substitution to the form 


x = 


1 

u 


we have 


Hence, 


That is, 


da; 1 

d u u 2 



-^du 

u z 


z = [ 1 = - In (u + a/m 2 + 1) + C. 

J Vu 2 + 1 


Returning to the original variable, x, we have 


z — — In 



+ C. 


Note: 

This example is somewhat harder than would be expected under examination condi- 
tions. 

12.4.2 INTEGRALS INVOLVING A FUNCTION AND ITS DERIVATIVE 

The method of integration by substitution provides two useful results applicable to a wide 
range of problems. They are as follows: 

/ {f(x)] n f'(x)dx = //'/ + C 

provided n / -1. 

(b) 

I W) ix = >nf(x - +C ‘ 


2 



These two results are readily established by means of the substitution 

u = f(x). 

In both cases ^ = f'(x ) and hence ^ This converts the integrals, respectively, into 


(a) 

and (b) 



u n+1 
n + 1 


+ C 



In u + C. 


EXAMPLES 


1. Evaluate the definite integral 



sin 3 x. cos a; da;. 


Solution 

In this example we can consider sin a; to be f(x) and cos a; to be f'(x). 
Thus, by quoting result (a), we obtain 


7 r 
3 


sin 3 a;. cos a;da; 


sin 4 a; 


J o 


9 

64’ 


using sin | = 

2. Integrate the function 


2a; + 1 
x 2 + x — 11 


with respect to x. 

Solution 

Here, we can identify x 2 + x — 11 with f(x) and 2a; + 1 with f'(x). 
Thus, by quoting result (b), we obtain 


/ „ — ^ — da; = ln(a; 2 + x — 11) + C. 

J x 2 + x — 11 


3 



12.4.3 EXERCISES 


1. 


2 . 


3. 


Use the substitution u — x + 3 in order to determine the indefinite integral 

J x\/3 + x (lx. 

Use the substitution u = x 2 — 1 in order to evaluate the definite integral 

[ xVx 2 — 1 dx. 


J i 

Integrate the following functions with respect to x: 


faj 

sin' a;, cos a:; 

(b) 

cos 5 a;. sin a:; 

(c) 

4a: — 3 

2a; 2 - 3a; + 13 

(d) 

cot X. 


12.4.4 ANSWERS TO EXERCISES 


1 . 


2 . 



2(x + 3)i +C. 



1 3 

-242 ~ 39.192 
3 


4 



3. (a) 


sin 8 a; 

8 


+ C; 


(b) 


(c) 

(d) 


cos 6 a; 

6 


+ C; 


ln(2a; 2 — 3a: + 13) + C; 


In sin x + C. 


5 
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UNIT 12.5 - INTEGRATION 5 


INTEGRATION BY PARTS 
12.5.1 THE STANDARD FORMULA 

The technique to be discussed here provides a convenient method for integrating the product 
of two functions. However, it is possible to develop a suitable formula by considering, instead, 
the derivative of the product of two functions. 


We consider, first, the following comparison: 


-r- [x sin x] = x cos x + sin x 


x cos x = ^ [x sin x] — sin x 

uT = 

ax ax'- -i ax 

J x cos x dx = x sin x — f sin x da; 

J uf- da; = uv - J da; 

J ax J ax 

= x sin x + cos x + C 



We see that, by labelling the product of two given functions as u 'r , we may express the 
integral of this product in terms of another integral which, it is anticipated, will be simpler 
than the original. 

To summarise, the formula for “integration by parts” is 

f dv f dn 

/ u — — da; — uv — / v— da;, 
a da; a da; 


EXAMPLES 


1. Determine 

I — j x 2 e 3x da;. 

Solution 

In theory, it does not matter which element of the product x 2 e 3x is labelled as u and 
which is labelled as but the integral obtained on the right- hand-side of the integration 
by parts formula must be simpler than the original. 

In this case we shall take 

9 . du 3 _ 

u = a> and — — e . 
dx 


1 




Hence, 


/* p 3x 

I = x 2 -— / — .2x dx. 

3 J 3 

That is, 


I 





xe 3 ' T dx. 


The integral on the right-hand-side still contains the product of two functions and so 
we must use integration by parts a second time, setting 


u = x and 


dx 

dx 


= e 


3x 


Thus, 


1 2 

I = V e 3* _ f 

3 3 


^3x n ^3x 


x / — .1 dx 

3 2 3 


The integration may now be completed to obtain 

I = 1 x 2 e 3x - -xe 3x + ^e 3x + C, 

3 9 27 

or 


0 3a; 


7 27 L 


9x 2 — 6x + 2 + C. 


2. Determine 


/ = / x lnx dx. 


Solution 

In this case, we cannot effectively choose 4^ = lnx since we would need to know the 
integral of lnx in order to find v. Hence, we choose 


u — lnx and 


dx 

dx 


x, 


obtaining 


2 



That is, 


I = (lnx)y 



x 2 

y 


dx. 

X 


giving 


3. Determine 



x 

2 


dx, 


/ = 


1 

-x 2 In x b C. 

2 4 


/ = 


/ 


In x dx. 


Solution 

It is possible to regard this as an integration by parts problem if we set 


We obtain 


u = In x and 


dv 

da; 


1 . 


/ = x In x 



giving 


/ = x In x — x + C. 


4. Evaluate 



sin l x da;. 


Solution 

In a similar way to the previous example, it is possible to regard this as an integration 
by parts problem if we set 


u = sin l x and 


du 

da; 


1 . 


3 



We obtain 


/ = hsin x 


-i~U 


x. 


/o a/1 — X 1 


da;. 


That is, 


/ = [a;sin _1 a; + \/l — a; 2 ]o = — — 1. 


5. Determine 


1=1 e 2x cosx clx. 


Solution 

In this example, it makes little difference whether we choose e 2x or cos a; to be u: but 
we shall set 


dx> 


u = e and — = cos a:. 

da; 


Hence, 


I = e 2x sin x — I ('sina;).2e^ da;. 


2x 


That is, 


/ = e 2x sin x — 2 / e^ x sin x da; 


2x , 


Now we need to integrate by parts again, setting 


dn 


u = e and — = sin x. 
ax 


Therefore, 


I = e 2x sin x - 2 


—e 2x cos x — J (— cosa;).2e 23: da; 


In other words, the original integral has appeared again on the right hand side to give 


/ = e 2x sin x — 2 —e 2x cos x + 21 1 . 


4 



On simplification, 


5 1 = e 2x sin x + 2e 2x cos x, 


so that 


I 



sin x + 2 cos x] + C. 


Note: 

The above examples suggest a priority order for choosing u in a typical integration by parts 
problem. For example, if the product to be integrated contains a logarithm or an inverse 
function, then we must choose the logarithm or the inverse function as u; but if there are 
powers of x without logarithms or inverse functions, then we choose the power of x to be u. 

The order of priorities is as follows: 

1. LOGARITHMS or INVERSE FUNCTIONS; 

2. POWERS OF x] 

3. POWERS OF e. 

12.5.2 EXERCISES 

integration by parts to evaluate the definite integral 

f 1 x 3 e 2x dx. 

Jo 

integration by parts to integrate the following functions with respect to x: 

x 2 cos 2x; 
x 5 In x; 
tan~ 1 x; 


1. Use 


2. Use 
fa) 


(b) 


(c) 


5 



(d) 


a; tan l x. 


3. Use integration by parts to evaluate the definite integral 



e 2x sin 3a; da;. 


12.5.3 ANSWERS TO EXERCISES 


1. 


2. (a) 


(b) 


(c) 


(d) 


3. 



- i 

6a; 2 + 6a; — 3) 

. o 



~ 1.299 


1 

4 


[2a; 2 


sin 2x + 2x cos 2x 


sin 2a;] + C; 


|g[6 lnx-1 } + C; 


xtan lx ~ 2 bi(l + x 2 ) + C; 


1 

2 


a; 2 tan l x — x + tan l x] + C. 


e 


—2x 


13 


(3 cos 3a; 


2 sin 3a;) 

. o 


— y^(e _27r + i) ~ -0.231 

-Lo 


6 
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UNIT 12.6 - INTEGRATION 6 
INTEGRATION BY PARTIAL FRACTIONS 


12.6.1 INTRODUCTION AND ILLUSTRATIONS 

If the ratio of two polynomials, whose denominator has been factorised, is expressed as a sum 
of partial fractions, each partial fraction will be of a type whose integral can be determined 
by the methods of preceeding sections of this chapter. 

The following summary of results will cover most elementary problems involving partial 
fractions: 

RESULTS 


1 . 


2 . 


3. 


4. 


or 


and 


5. 


ax 


da; 


- In (ax + b) + C. 
a 


1 d * = i + C provided n ? 1. 


(ax + b ) 


a —n + 1 


1 i 1, _i£ 

dx = -tan — ho. 


a 2 + x 2 


a a 


a 2 — x 2 


dx = -tanh 1 — + C, 
a a 


— In 
2 a 


fa + x\ , . . 

+ G when x < a, 

\a — xj 


In ( 1 + - ) + C when Ixl > a. 


2 a \x — a 


2 ax + 6 
ax 2 + bx + c 


dx = ln(ax 2 + bx + c) + C. 


1 



ILLUSTRATIONS 


We use some of the results of examples on partial fractions in Unit 1.8 

1 . 


7x + 8 


(2a; + 3)(a; — 1) 


da; = 


1 3 

+ 


1.2a; + 3 x — 1 


da; 


2 . 


= - ln(2a; + 3) + 3 ln(x — 1) + C. 


r 8 3x 2 + 9 r 8 

k (x - 5)(x 2 + 2x + 7) C ^ = Je 


2 x + 1 

x — 5 x 2 + 2x + 7 \ 


da; 


X 

2 ln(x — 5) + - ln(a; 2 + 2a; + 7) 

2 J 6 


~ 2.427 


[ 9 - 1 

-1 

3 1 

1 

! (x + l) 2 (a; — 2) J 

x + 1 

(x + l) 2 ' x — 2 


da; 


= — ln(x + 1) H 1- ln(x — 2) + C. 

x-\-l 


Ax 2 + x + 6 


(x — 4) (a; 2 + 4a; + 5) 


da; = 


2 2a; + 1 


x — 4 x 2 + 4a; + 5j 


da; 


= 2 ln(x — 4) + ln(ar + 4a; + 5) — 3tan 1 (x + 2 ) + C. 


Note: 

In the last example above, the second partial fraction has a numerator of 2a; + 1 which is 
not the derivative of x 2 + 4a; + 5. But we simply rearrange the numerator as (2a; + 4) — 3 
to give a third integral which requires the technique of completing the square (discussed in 
Unit 12.3). 


2 



12.6.2 EXERCISES 


Integrate the following functions with respect to x: 


1. (a) 


(b) 


(c) 


(d) 


(e) 


(f) 


3a; + 5 

(x + l)(a; + 2) ’ 


17a; + 11 

(x + l)(x — 2) (a; + 3) ’ 


3a; 2 -8 

(x — l)(x 2 + x — 7) ’ 


2a; + 1 

(x + 2) 2 (a; — 3) ’ 


9 + 11a; — x 2 
(x + l) 2 (a; + 2) ’ 


(x + 2) (a; — 4) ’ 


2. Evaluate the following de fini te integrals 


(a) 


(b) 



7x 2 + 11a; + 47 
(x — l)(a; 2 + 2a; + 10) 


da; 



4a; 2 + 1 

n x 

x(2x — l) 2 


3 



12.6.3 ANSWERS TO EXERCISES 

1. (a) 


(b) 

(c) 


2 ln(x + 1) + ln(x + 2) + C; 
ln(x + 1) + 3 ln(x — 2) — 4 In (a; + 3) + C\ 
ln(x — 1) + ln(x 2 + x — 7) + C\ 


(d) 

(e) 


(f) 


3 

5(x + 2) 


^ In (a; + 2) + ^ ln(x - 3) + C\ 


3 16 17 


3 

a: + 1 


(a: + l) 2 x + 1 a; + 2 
+ 16 In (a; + 1) — 17 ln(a: + 2) + C; 


r 4 2r 3 16 ^12 

— + — + 6a; 2 + 40a; + — In (a; + 2) + — ln(ar - 4) + C. 


2. (a) 


(b) 


5 ln(a; — 1) + ln(ar + 2a; + 10) + -tan 


i -i^ + r 


~ -2.726 


J 2 


n 3 


In a; 


2x- 1 


~ 2.699 


J i 


4 
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UNIT 12.7 - INTEGRATION 7 - FURTHER TRIGONOMETRIC FUNCTIONS 
12.7.1 PRODUCTS OF SINES AND COSINES 

In order to integrate the product of a sine and a cosine, or two cosines, or two sines, we may 
use one of the following trigonometric identities: 


sinAcosB = - [sin(A + B) + sin(A — B)] ; 

cosAsinB = - [sin(A + B) — sin(A — B)] ; 
cosAcosB = - [cos(A + B) + cos(A — B)] ; 
sinAsinB = ^ [cos (A — B) — cos (A + B)] . 


EXAMPLES 

1. Determine the indefinite integral 


sin 2x cos 5a; da;. 


Solution 


sin 2a; cos 5a; da; 



sin 3a;] da; 


cos 7x 
14 


cos 3a; 
6 


+ C. 


2. Determine the indefinite integral 

j sin 3a; sin x da;. 


Solution 


sin 3a; sin x da; 



cos 4a;] da; 


sin 2a; 
4 


sin 4a; 
8 


+ C. 


1 



12.7.2 POWERS OF SINES AND COSINES 

In this section, we consider the two integrals, 

J sm n xdx and J cos" da;, 


where n is a positive integer. 

(a) The Complex Number Method 

A single method which will cover both of the above integrals requires us to use the methods 
of Unit 6.5 in order to express cos"a; and sin n x as a sum of whole multiples of sines or cosines 
of whole multiples of x. 

EXAMPLE 

Determine the indefinite integral 

[ sin 4 x da;. 


Solution 

By the complex number method, 


sin 4 a; = -[cos 4a; 


4 cos 2a; + 3] . 


The Working: 


■4o4 • 4 

j 2 sm x = 




4 


5 


where z = cos x + j sin x. 
That is, 


16sin 4 a; = z 4 — 4 z 3 . 


1 


z 





4 


J 


or, after cancelling common factors, 


2 



which gives 


16sin 4 x = z 4 - Az 2 + 6 - \ = z 4 + \ - A (z 2 + \ ) + 6, 

z 2 z A z 4 V z 2 


or 


Hence, 


IGsinx = 2 cos 4a; — 8 cos 2x + 6, 


sin 4 a; = - [cos 4a; — 4 cos 2a; + 3] . 


/' • 4 i 1 

'sin 4a; sin 2a; 

/ sm x ax = - 

— ; 4 — h 3a; 

J 8 

.4 2 


+ C 


= [sin 4a; — 8 sin 2a; + 12a;] + C. 

O Zi 

(b) Odd Powers of Sines and Cosines 

The following method uses the facts that 

f [sin x] = cos x and f [cos x] = sin x. 
ax ax 

We illustrate with examples in which use is made of the trigonometric identity 

cos 2 A + sin 2 A = 1. 


EXAMPLES 

1. Determine the indefinite integral 

J sin 3 .x' da;. 


3 



Solution 


That is, 


sin 3 x dx = / sin 2 x. sinx dx. 


J sin 3 xdx = j (l — cos 2 x) sinx dx 
= j (sinx — cos 2 x. sinx) dx 


cos 3 x 


= — cos x H b C. 

o 


2. Determine the indefinite integral 


J cos 7 x dx. 


Solution 


That is, 


cos 7 x dx = / cos 6 x.cosx dx. 


J cos 7 x dx = J (l — sin 2 x) . cosx dx 
J (l — 3sin 2 x + 3sin 4 x — sin 6 x) . cosx dx 


= sinx — sin 3 x + 3.- 


sm“x sin 7 x 


+ C. 


(c) Even Powers of Sines and Cosines 

The method illustrated here becomes tedious if the even power is higher than 4. In such 
cases, it is best to use the complex number method in paragraph (a) above. 

In the examples which follow, we shall need the trigonometric identity 

cos 2x1 = 1 — 2sin 2 A = 2cos 2 A — 1. 


4 



EXAMPLES 


1. Determine the indefinite integral 


sin 2 a; da;. 


Solution 


f . o , f 1/^ ^ N , x sin 2x 

J sm x ax = J -(1 — cos lx) da; = — — f- C. 


2. Evaluate the definite integral 


cos 2 a; da;. 


Solution 


7 r 7T - 

/ cos 2 a; dx= -(1 + cos 2a;) da; 
/o ao 2 


a; sin 2a; 
2 + ~ 


2 7T 

J 0 4 


3. Determine the indefinite integral 


cos 4 a; da;. 


Solution 


That is, 


cos 4 a; da; = 


cos 2 a; 


1 2 


da; = 


n2 


-(1 + cos 2a;) 


da;. 


J cos 4 a; d x = j - (l + 2 cos 2a; + cos 2 2a;) da; 
= J ^ ^1 + 2 cos 2a; + ^[1 + cos 4a;] ^ da; 


a; sin 2a; x sin 4a; ^ 

+ - + — r^ + C 1 


4 4 


32 


3a; sin 2a; sin 4a; 
~8~ + 4 32 


C. 


5 



12.7.3 EXERCISES 


1. Determine the indefinite integral 


J cos x cos 3a: dx. 


2. Evaluate the definite integral 


7 r 

3 


cos 4x sin 2x dx. 


3. Determine the following indefinite integrals: 

(a) 

J sin 5 .x' dx; 

(b) 

J cos 3 x dx. 


4. Evaluate the following definite integrals: 
fa) 


(b) 


sin 4 x dx; 


cos 6 x dx. 


12.7.4 ANSWERS TO EXERCISES 


1 . 


2 . 


sin 4x 
8 


sin 2x 
— + a 


V3 

4 




-0.433 


6 



3. (a) 


cos 5 a; 

5 


+ 2 


cos 3 a; 

3 


— cos x + C, 


or 


1 [ cos 5a; 
16 [ 5 


+ 


5 cos 3a; 
3 


10 cos a; 


+ C 


by complex numbers; 


(b) 


sin x — 


sin 3 x 
3 


+ c, 


or 


1 

4 


sin 3a; 
3 


3 sin x 


+ C 


by complex numbers. 


4. (a) 

(b) 


1.735 x 10 3 approx; 


- 1 . 


7 
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UNIT 12.8 - INTEGRATION 8 


THE TANGENT SUBSTITUTIONS 

There are two types of integral, involving sines and cosines, which require a special substi- 
tution using a tangent function. They are described as follows: 

12.8.1 THE SUBSTITUTION t = tana: 

This substitution is used for integrals of the form 


a + &sin 2 a; + ccos 2 x 


da;, 


where a, b and c are constants; though, in most exercises, at least one of these three constants 
will be zero. 

A simple right-angled triangle will show that, if t — tana;, then 


Sill x = 


VT+P 


and cos x = 


VTTP' 


Furthermore, 


EXAMPLES 



1. Determine the indefinite integral 


4 — 3sin a: 


da;. 


1 



Solution 


1 

4 — 3sin 2 x 


da; 


4-^‘i + t 2 


dt 


4 + 1 2 


dt 


= -tan 1 — f C — -tan 1 
2 2 2 


tana;' 


2. Determine the indefinite integral 


Solution 


sin 2 a; + 9cos 2 a; 


da;. 


sin 2 a; + 9cos 2 a; 


da; 


t 2 i _9_ ' I +2 
1+t 2 + 1 +t 2 ^ 


dt 


1 


t 2 + 9 


dt 


It X 

-tan -1 — K C — -tan -1 
3 3 3 


tana;' 


12.8.2 THE SUBSTITUTION t = tan(x/2) 
This substitution is used for integrals of the form 


a + b sin x + c cos x 


da;, 


+ C. 


+ C. 


2 



where a, b and c are constants; though, in most exercises, one or more of these constants 
will be zero. 


In order to make the substitution, we make the following observations: 


sin a; = 2 sin(x/2). cos(a;/2) = 2 tan(a;/2).cos 2 (a;/2) 


2 tan(a;/2) 
sec 2 (a;/2) 


2 tan(a;/2) 

1 + tan 2 (x/2) 


Hence, 


sin x = 


2 1 

l + t 2 ' 


(ii) 


cosa: = cos 2 (a:/2)— sin 2 (a:/2) = cos 2 (a;/2) 1 — tan 2 (a;/2) 


1 — tan 2 (a;/2) _ 1 — tan 2 (a;/2) 
sec 2 (x/2) 1 + tan 2 (a;/2) ’ 


Hence, 


cos a; = 


1 -t 2 
l + t 2 ' 


(iii) 


df 

da; 


1 

2 


sec 2 (a;/2) 



+ tan 2 (a;/2) 


1 

2 


[l + t 2 ]. 


Hence, 


da; _ 2 

d t l+t 2 

EXAMPLES 


1. Determine the indefinite integral 


1 


1 + sin x 


da; 


3 



Solution 


1 


da; 


1 + sin x 
1 2 


l + T&'l + * 2 


dt 


1 + t 2 + 2t 
r 2 


dt 


(l + *) s 


dt 


1 + 1 

2. Determine the indefinite integral 

/ 

Solution 


+ C = 


1 + tan(a;/2) 


+ C. 


4 cos x — 3 sin x 
1 

4 cos x — 3 sin x 
1 2 


da;. 


da; 


1 -t 2 6 1 


4 — 4t 2 — 6 1 


: l+t 2 l+i 2 


dt = 


'1 + t 2 


dt 


J 

(2t — l)(t + 2) 

[ 1 

■ 1 2 ' 

/ 5 

.t + 2 2t — 1. 


2 1 2 + 3t - 2 
dt 


dt 


-[ln(t + 2) — ln(2t — 1)] + C — - In 
5 5 


dt 

tan(x/2) + 2 
2 tan(a;/2) — 1 


+ C. 


4 



12.8.3 EXERCISES 


1. Determine the indefinite integral 


4 + 12cos 2 a; 


da;. 


2. Evaluate the definite integral 


/ o 5cos 2 a; + 3sin 2 a; 


da;. 


3. Determine the indefinite integral 


1 


5 + 3 cos x 


da;. 


4. Evaluate the definite integral 



1 

12 sin a; + 5 cos a; 


da;. 


12.8.4 ANSWERS TO EXERCISES 


1. 


2 . 


3. 


4. 


X 1 

-tan 

4 


tana; 

2 


+ C. 


, — tan 
a/15 


-l 



X _1 

-tan 

2 


tan(x/2) 

2 


+ C. 


r i i 3 - 1 

— [51n(5tan(a;/2) + 1) — ln(tan(x/2) — 5) 

.13 J3 


0.348 


5 
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UNIT 12.9 - INTEGRATION 9 


REDUCTION FORMULAE 
INTRODUCTION 

For certain integrals, both definite and indefinite, the function being integrated (that is, the 
“integrand” ) consists of a product of two functions, one of which involves an unspecified inte- 
ger, say n. Using the method of integration by parts, it is sometimes possible to express such 
an integral in terms of a similar integral where n has been replaced by (n — 1), or sometimes 
(n — 2). The relationship between the two integrals is called a “reduction formula” and, 
by repeated application of this formula, the original integral may be determined in terms of 
n. 

12.9.1 INDEFINITE INTEGRALS 

The method will be illustrated by examples. 

EXAMPLES 

1. Obtain a reduction formula for the indefinite integral 

I n — J x n e x da; 


and, hence, determine J 3 . 

Solution 

Using integration by parts with u = x n and 4^ = e x , we obtain 

I„ = x n e x — j e x .nx n ~ 1 dx. 


That is, 


Substituting n — 3, 


where 


In = X n e x - nl n -1 
h = x z e x - 3/ 2 , 
I 2 = x 2 e x - 2/i 


1 



and 


h = xe x - J 0 


But 


/ 0 = j e x dx = e x + constant, 


which leads us to the conclusion that 

\x 2 e x - 2 (xe x - e x ) 


h = x 3 e x - 3 


+ constant. 


In other words, 


h = e x 


x 6 — 3x + 6x — 6 +C, 


where C is an arbitrary constant. 

2. Obtain a reduction formula for the indefinite integral 

/„ = [ x n cos x dx 


and, hence, determine I 2 and I3. 

Solution 

Using integration by parts with u = x n and = cosa:, we obtain 


I n = x n sin x — j sin x.nx n ~ dx = x n sin x — n J x n ~ sin x da;. 

Using integration by parts in this last integral, with u = a; n_1 and ^ = sin a;, we obtain 
I n = a;” sin a; — n |— x n ~ l cosa; + J cos x.(n — l)x n ~ 2 da;| . 

That is, 


I n = x n sin a; + nx n 1 cosa; — n(n — l)/ n _ 2 . 


Substituting n — 2, 


J 2 = x 2 sin x + 2x cos x — 2/ 0 , 


2 



where 


/o = 


/ 


cos a; da; 


sin a; + constant. 


Hence, 


J 2 = x 2 sin x + 2x cos x — 2 sin x + C, 

where C is an arbitrary constant. 

Also, substituting n = 3, 


J 3 = x 3 sin a; — 3a; 2 cos a; — 3. 2. A, 


where 

1 1 — j x cos x dx = x sin x + cos x + constant. 

Therefore, 

J 3 = x 3 sin x — 3a; 2 cos x — 6a; sin x — 6 cos x + D, 
where D is an arbitrary constant. 

12.9.2 DEFINITE INTEGRALS 

Integrals of the type encountered in the previous section may also include upper and lower 
limits of integration. The process of hireling a reduction formula is virtually the same, except 
that the limits of integration are inserted where appropriate. Again, the method is illustrated 
by examples. 

EXAMPLES 

1. Obtain a reduction formula for the definite integral 

I n = f 1 x n e x da; 

Jo 


and, hence, determine / 3 . 


3 



Solution 

From the first example in section 12.9.1, 

In [*£ 6 ]g Ylln—l ^ 

Substituting n — 3, 


where 


h — e ~ 3 / 2 , 


and 


I 2 — e — 2/i 
I\ — e — Jq. 


But 


J 0 = / e x dx = e — 1, 
Jo 

which leads us to the conclusion that 


J 3 = e — 3e + 6e — 6e + 6 = 6 — 2e. 


2. Obtain a reduction formula for the definite integral 


I n = x n cos x dx 
Jo 


and, hence, determine J 2 and I3. 

Solution 

From the second example in section 12.9.1, 


In = 


x n sin x + nx n 1 cos x 


n(n - l)/ n _ 2 


Substituting n — 2, 


-mr n 1 - n(n - l)I n - 2 - 


I 2 — — 27T — 2/o, 

where 

riv 

Iq= cosx dx = [sinxln = 0. 
Jo 


4 



Hence, 


h = -2vr. 


Also, substituting n = 3, 


h = -3vr 2 - 3.2./!, 


where 


P7V 

Ii = xcosx da; = [xsina; + cosxln = — 2. 
Jo 


Therefore, 


h = -3vr 2 + 12. 


12.9.3 EXERCISES 

1. Obtain a reduction formula for 


I n = I x n e 2x da; 

when n > 1 and, hence, determine / 3 . 

2. Obtain a reduction formula for 


I n = / x n e 2x dx 

Jo 

when n > 1 and, hence, evaluate 1 4. 

3. Obtain a reduction formula for 


In 


x n sin x da; 


when n > 1 and, hence, determine 14. 

4. Obtain a reduction formula for 


P7T 

I n = / aAsina; da; 
Jo 

when n > 1 and, hence, evaluate / 3 . 


5 



5. If 


In 



(lna;) n dx, 


where n > 1, show that 


I n = a;(lna;) n - n/ n _i 


and, hence, determine J3. 
6. If 


In 




da;, 


show that 


In = 


2n + 1 L 

Hint: Write ( x 2 + a 2 )' 1 as Ida; 2 + a 2 ) n 


x(x 2 + a 2 ^j + 2 na 


12.9.4 ANSWERS TO EXERCISES 


In 2 L' 


x n e 2x — n/„_i 


giving 


/ 3 



4a; 3 — 6a; 2 + 6a; — 3 


2 . 


giving 




In— 1 • 


+ c. 


6 



3. 


I n = — x n cos x + nx 11 1 sin x — n(n — 1 )/ n _ 2 , 


giving 


/ 4 = —x 4 cos x + 4a; 3 sin x + 12x 2 cos x — 24a; sin x — 24 cos a; + C. 


4. 


J n = 7r n - n(n - l)/ n _ 2 , 


giving 


J 3 = 7T 3 — 67T. 


5. 


/.3 


a; 


In a;) 


3(lna;) 2 + 6 In x — 6 


+ C. 


7 
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UNIT 12.10 - INTEGRATION 10 


FURTHER REDUCTION FORMULAE 


INTRODUCTION 

As an extension to the idea of reduction formulae, there are two particular definite integrals 
which are worthy of special consideration. They are 


sin n x da; and / cos"a; da;. 


But, first, we shall establish the reduction formulae for the equivalent indefinite integrals. 

12.10.1 INTEGER POWERS OF A SINE 

Suppose that 


I n = / sin"a; da;; 


then, by writing the integrand as the product of two functions, we have 


I„ = / sin n x'sinx da;. 


Using integration by parts, with u = sin" l x and 4^ = sin a;, we obtain 


I n = sin" x a;(— cos a;) + J (n — l)sin" 2 a;cos 2 a; da;. 


But, since cos 2 a; = 1 — sin 2 a;, this becomes 


I n = -sin" x a; cos x + (n - 1) [J n _ 2 - I n . ■ 


Thus, 


I n = — —sin" x a;cosa; + in — l )/ n -2 • 

71 


1 



EXAMPLE 


Determine the indefinite integral 


Solution 


sin 6 a; dx. 


where 


h = 


1 r 
6 - 


— sin 5 :r cos a: + 5/4 


1 r 

4 - 


I 4 = - — sin 3 a; cos x + 3/ 2 , / 2 = - [— sin x cos x + J 0 j 


1 

2 


and 


Hence, 


Jo = / da; = x + constant. 


J 2 = - [— sin x cos x + x + constant] ; 


h = 


, 3 3 

-sin a; cos x — - sin a; cos x + -a; + constant 


h = 


6 L 


. = 5.0 15 . 15 

-sm a; cos x sin a: cos x sin x cos x H x + constant 

4 8 8. 


/' . a 1. 5 5o 5 . 5a; 

1 hus, / sm x ax = — sm a; cos x — |'--sini' cos x sm x cos x H ho, 

’7 6 24 16 16 


where C is an arbitrary constant. 


2 



12.10.2 INTEGER POWERS OF A COSINE 

Suppose that 


I n = j cos"a; da;; 


then, by writing the integrand as the product of two functions, we have 


I n = / COS" l X COS X dx 


Using integration by parts, with u = cos" l x and 4 ^ = cos a;, we obtain 


I n = cos" ^sina; + J (n — l)cos" 2 a:sin 2 a; da;. 


But, since sin 2 a; = 1 — cos 2 a;, this becomes 


I n = cos" a; sin a; + (n - l)[/„_ 2 - I n ■ 


Thus, 


In = ~ 

n L 


cos" 1 a;sina; + (n — l)/ n _2 


EXAMPLE 

Determine the indefinite integral 


/ 


cos 5 a; da;. 


Solution 


h — 



x sin x + 4/3 


5 


3 



where 


1 r o 1 

1 ^ = — cos x sin x + 2 /i 


= / cos a; da; = sin a; + constant. 


Hence, 


1 r 2 i 

J 3 = — cos x sin x + 2 sin x + constant ; 


r If 4 . 4 , 8 . 1 

15 — - cos x sm x H — cos x sm x H — sm x + constant ; 
5 L 3 3 J 


We conclude that 


f 5 , 1 4 . 4 2 8 . 

/ cos x ax = -cos x sm x H cos a; sm x H sina; + G, 

J 5 15 15 


where C is an arbitrary constant. 

12.10.3 WALLIS’S FORMULAE 

Here, we consider the definite integrals 


sin n a; da; and / 2 cos n a; da;. 


Denoting either of these integrals by J n , the reduction formula reduces to 


j — H 1 T 

1 n In — 2 

n 


in both cases, from the previous two sections. 


4 



Convenient results may be obtained from this formula according as n is an odd number or 
an even number, as follows: 

(a) n is an odd number 

Repeated application of the reduction formula gives 

n — In — 3 n — 5 642^ 

n ~ n ’ n — 2 ’ n — 4 ’ ' ' ' 7'5'3 1- 


But 

7T 7T 

Ii — / 2 sin x da; or lx = / cos a; da;, 
ao ao 

both of which have a value of 1. 

Therefore, 

_ (n - l)(n - 3)(n - 5). . . . 6.4.2 
" “ n(n — 2)(n — 4). . . . 7.5.3 ’ 


which is the first of “Wallis’s formulae” . 

(b) n is an even number 

This time, repeated application of the reduction formula gives 

n — 1 n — 3 n — 5 5 3 1 ^ 

n ~ n n — 2 n — A' ' ' ' 6'4'2 °‘ 


But 


/o = 



da; = 


7T 

2 ' 


Therefore, 


_ (n - l)(n - 3)(n - 5). . . . 5.3.1 7 r 
n “ n(n — 2)(n — 4). . . . 6.4.2 2’ 


5 



which is the second of “Wallis’s formulae”. 


EXAMPLES 


1. Evaluate the definite integral 


Solution 



sin 5 x dx. 


. 5 , 4.2 

sm x ax = —— 
5.3 


8 

15' 


2. Evaluate the definite integral 


Solution 



cos 4 x dx. 


4 3.17 r 

cos x ax = 

4.2 2 


37T 

16' 


12.10.4 COMBINATIONS OF SINES AND COSINES 

Another type of problem to which Wallis’s formulae may be applied is of the form 


7 r 
2 


sin m xcos n x dx, 


where either m or n (or both) is an even number. We simply use sin 2 x = 1 — cos 2 x or 
cos 2 x = 1 — sin 2 x in order to convert the problem to several integrals of the types already 
discussed. 

EXAMPLE 


Evaluate the definite integral 


Solution 



cos 5 xsin 2 x dx. 


cos 5 xsin 2 x dx = 


cos 5 x 


(l — cos 2 x) 


dx = 


( 5 7 

, cos x — COS X 


dx, 


6 



which may be interpreted as 


4.2 5.4.3 8 16 _ 8 

545 ~ 6A2 “ 15 ~~ 35 “ H)5‘ 


12.10.5 EXERCISES 


1. Determine the indefinite integral 

J cos 4 a; dx. 

2. Determine the indefinite integral 


/ 


sin'x d.x\ 


3. Evaluate the definite integral 


4. Evaluate the definite integral 


5. Evaluate the definite integral 


6. Evaluate the definite integral 



sin 6 x Ax. 



cos 9 x Ax. 



cos 2 xsin 6 x dx. 


7 r 
2 


sin 3 a;cos 4 a: dx. 


7 



12.10.6 ANSWERS TO EXERCISES 


1 . 

1 3 3a: 

-cos 3 a: sin x H — cos x sin x H I - C. 

4 8 8 

2 . 

1.6 6. 4 24 . 2 16 

— -sm x cos x sin x cos x sin x cos x cos x + C. 

7 35 105 35 

3. 

57T 

32' 

4. 

128 

315 

5. 

57T 

32' 

6 . 

4 

_ 105 
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UNIT 13.1 - INTEGRATION APPLICATIONS 1 


THE AREA UNDER A CURVE 
13.1.1 THE ELEMENTARY FORMULA 

We shall consider, here, a method of calculating the area contained between the a;-axis of a 
cartesian co-ordinate system and the arc, from x = a to x = b, of the curve whose equation 

is 


V = f(x). 


Suppose that A(x) represents the area contained between the curve, the rr-axis, the y - axis 
and the ordinate at some arbitrary value of x. 



A small increase of 5x in x will lead to a corresponding increase of 5 A in A approximating 
in area to that of a narrow rectangle whose width is 5x and whose height is f{x). 

Thus, 


5A ~ f(x)5x, 


which may be written 


SA 

5x 




/(*)• 


By allowing 5x to tend to zero, the approximation disappears to give 


1 



dA 

da; 


= f{x). 


Hence, on integrating both sides with respect to x, 


A(x) = I f(x) dx. 


The constant of integration would need to be such that A — 0 when x = 0; but, in fact, we 
do not need to know the value of this constant because the required area, from x = a to 
x — b, is given by 


A{b) — A{a) = f f(x) dx. 
J a 


EXAMPLES 


1. Determine the area contained between the a;- axis and the curve whose equation is 
y = sin 2x, from x — 0 to x — 

Solution 


r- 

1 4 . 

cos 2a;' 

/ sm 2x dx = 


Jo 

2 



2 



2. Determine the area contained between the rc-axis and the curve whose equation is 
y = 3x — x 2 , from x — 1 to x — 2. 

Solution 




13.1.2 DEFINITE INTEGRATION AS A SUMMATION 

Consider, now, the same area as in the previous section, but regarded (approximately) as 
the sum of a large number of narrow rectangles with typical width 5x and typical height 
f(x). The narrower the strips, the better will be the approximation. 



Hence, we may state an alternative expression for the area from x — a to x — b in the form 


3 



lim 'Y' f(x)8x. 

Sx^O y 


Since this new expression represents the same area as before, we may conclude that 


x=b 


lim 

5x -> o ^ 


f(x)Sx 



f(x)Sx. 


Notes: 

(i) The above result shows that an area which lies wholly below the x-axis will be negative 
and so care must be taken with curves which cross the x-axis between x = a and x = b. 

(ii) If c is any value of x between x = a and x = b, the above result shows that 

/ /(x) d x = f f(x ) d x + f f(x ) dx. 

J a J a J c 


(iii) To calculate the TOTAL area contained between the x-axis and a curve which crosses 
the x-axis between x = a and x = b, account must be taken of any parts of the area which 
are negative. 

(iv) It is usually a good idea to sketch the area under consideration before evaluating the 
appropriate definite integrals. 

(v) It will be seen shortly that the formula obtained for definite integration as a summation 
has a wider held of application than simply the calculation of areas. 

EXAMPLES 


1. Determine the total area between the x-axis and the curve whose equation is y = sin 2x, 
from x = ^ and x — 

Solution 


That is, 


cos2x 


7 T 
2 


2 


3tt 

8 



sin 2x dx — 



sin2x dx. 


cos 2x1 4 /l 1 \ 

”2“Jf = V~2V2) 



= 1 - 


1 

2 \/ 2 ' 


4 




2. Evaluate the definite integral, 



sin 2x da;. 


Solution 



sin 2a; da; 


cos 2a; 1 4 1 

2 “ ~2V^' 


13.1.3 EXERCISES 

1. Determine the areas bounded by the following curves and the a;-axis between the ordi- 
nates x — l and x = 3: 


(b) 

(c) 


y = 2ar + x + 1; 

y = (1 ~ x ) 2 -, 

y = 2y/x. 


2. Sketch the curve whose equation is 

y — (1 — x)(2 + x) 

and determine the area contained between the a;-axis and the portion of the curve above 
the a;-axis. 


5 



3. To the nearest whole number, determine the area bounded between x — 1 and x = 2 
by the curves whose equations are 


y = 3e 2x and y — 3e 


4. Determine the area bounded between x = 0 and x = | by 
are 

y = sin x and y = sin 2x. 

5. Determine the total area, from x — 0 to x — contained 
curve whose equation is 


the curves whose equations 


between the x-axis and the 


y = cos 5x. 


13.1.4 ANSWERS TO EXERCISES 


1. (a) 

70 _ 

y ; 

(b) 

8 _ 

3’ 


4V3-| 

2 . 

9 

2 ' 


5. 


0.25 


2^-1 

5^2 


~ 0.259 


6 
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UNIT 13.2 - INTEGRATION APPLICATIONS 2 


MEAN AND ROOT MEAN SQUARE VALUES 
13.2.1 MEAN VALUES 



On the curve whose equation is 


V = f{x), 


suppose that tj\ , y 2 , y-.u , y n are the ^-coordinates which correspond to n different 

^-coordinates, a = x±, x 2 , x 3 , , x n = b. 

The average (that is, the arithmetic mean) of these n ^-coordinates is 

V1+V2 + V3 + ±jjn 

n 

But now suppose that we wished to determine the average (arithmetic mean) of all the 
^-coordinates, from x — a to x — b on the curve whose equation is y — f(x). 

We could make a reasonable approximation by taking a very large number, n, of ^-coordinates 
separated in the x-direction by very small distances. If these distances are typically repre- 
sented by Sx, then the required mean value could be written 

V\bx + y 2 Sx + y 3 Sx + + y n 5x 

n5x 


1 



in which the denominator is equivalent to (6 — a + Sx), since there are only n — 1 spaces 
between the n ^-coordinates. 

Allowing the number of ^-coordinates to increase indefinitely, 5x will to tend to zero and we 
obtain the formula for the “Mean Value” in the form 


-i x=b 

M.V. = lim Y ySx. 

b- a 8x-+o 


That is, 


M.V. 



f(x) dx. 


Note: 

In cases where the definite integral in this formula represents the area between the curve 
and the a;- axis, the Mean Value provides the height of a rectangle, with base b — a, having 
the same area as that represented by the definite integral. 

EXAMPLE 

Determine the Mean Value of the function 


f(x) = x 2 — 5x 


from x = 1 to x = 4. 

Solution 

The Mean Value is given by 


M.V. = — - f (x 2 — 5x) da; = \ 
4 — 1 Ji 3 


x 

~3~ 


5a; 2 

~Y 


14 


J 1 


1 ■ 
3 . 





33 

Y' 


2 



13.2.2 ROOT MEAN SQUARE VALUES 

It is sometimes convenient to use an alternative kind of average for the values of a function, 
f(x), between x = a and x = b. 

The “Root Mean Square Value” provides a measure of “central tendency” for the nu- 
merical values of /(x) and is defined to be the square root of the Mean Value of /(x) from 
x = a to x = b. 

Hence, 


R.M.S.V. 



[ f(x )] 2 dx. 


EXAMPLE 

Determine the Root Mean Square Value of the function, /(x) = x 2 — 5, from x — 1 to x — 3. 

Solution 

The Root Mean Square Value is given by 


R.M.S.V. 



5) 2 dx. 


Temporarily ignoring the square root, we obtain the “Mean Square Value”, 


M.S.V. 



10x 2 + 25) dx 


1 Tx 5 

2 "5" 


10x 3 


- 3 

+ 25x 

. 1 




176 

~30~' 


Thus, 


R.M.S.V. 



2.422 


3 



13.2.3 EXERCISES 


1. (a) Determine the Mean Value of the function, {x — l)(a; — 2), from x = 1 to x = 2; 

(b) Determine, correct to three significant figures, the Mean Value of the function, 
2a ,^_ 5 , from x = 3 to x = 5; 

(c) Determine the Mean Value of the function, sin2t, from t — 0 to t — 

(d) Determine, correct to three places of decimals, the Mean Value of the function, 
e~ x , from x = 1 to x = 5; 

(e) Determine, correct to three significant figures, the mean value of the function, 
xe~ 2x , from x = 0 to x = 2. 

2. (a) Determine the Root Mean Square Value of the function, 3a; + 1, from x = —2 to 

x = 2; 

(b) Determine the Root Mean Square Value, of the function, e x , from x — 0 to x — 1, 
correct to three decimal places; 

(c) Determine the Root Mean Square Value of the function, cos a;, from x = | to x = 7r; 

3 

(d) Determine the Root Mean Square Value of the function, (4a; — 5 ) 2 , from x = 1.25 
to x — 1.5. 


13.2.4 ANSWERS TO EXERCISES 


1 . 


2 . 


(a) 

i. 

6’ 

(b) 

0.0775; 
2 . 

(c) 

7T 7 

(d) 

-0.076; 

(e) 

0.114 

(a) 

a/13 ~= 

(b) 

^ Y->- 

00 

-nI 

(c) 

/2- 

(d) 

1 

2’ 


3.606; 
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UNIT 13.3 - INTEGRATION APLICATIONS 3 


VOLUMES OF REVOLUTION 

13.3.1 VOLUMES OF REVOLUTION ABOUT THE X-AXIS 

Suppose that the area between a curve whose equation is 

V = f(x) 

and the x-axis, from x = a to x = b, lies wholly above the x-axis; suppose, also, that this 
area is rotated through 2n radians about the a;- axis. Then a solid figure is obtained whose 
volume may be determined as an application of definite integration. 



When a narrow strip of width, Sx, and height, y, is rotated through 2tt radians about the 
x-axis, we obtain a disc whose volume, SV, is given approximately by 

SV ~ 7T y 2 Sx. 

Thus, the total volume, V, obtained is given by 


V 


lim 

& E — >0 


x=b 

E 


7 ry 2 Sx. 


That is, 


V = 



ny 2 (lx. 


1 



EXAMPLE 


Determine the volume obtained when the area, bounded in the first quadrant by the x-axis, 
the y-axis, the straight line, x = 2, and the parabola, y 2 = 8x, is rotated through 2n radians 
about the x-axis. 

Solution 



r'2 

V = / 7T X 8x dx = [47TX 2 ]q = 167T. 

Jo 

13.3.2 VOLUMES OF REVOLUTION ABOUT THE Y-AXIS 

First we consider the same diagram as in the previous section: 



This time, if the narrow strip of width, 5x, is rotated through 27 t radians about the y-axis, 


2 



we obtain, approximately, a cylindrical shell of internal radius, x, external radius, x + Sx 
and height, y. 

The volume, 6V, of the shell is thus given by 


5V ~ 2nxy8x. 


The total volume is given by 


V 


lim 

< 5#— >0 


x=b 

y , 2tt xy5x. 

x=a 


That is, 


V 



2nxy (lx. 


EXAMPLE 

Determine the volume obtained when the area, bounded in the first quadrant by the x-axis, 
the y-axis, the straight line x = 2 and the parabola y 2 = 8x is rotated through 2n radians 
about the y-axis. 


3 



Solution 



V 



2nx x yj&x dx. 


In other words, 


V 


7t4v2 


3 

X 2 


dx = 7 t4\/2 


c: “I Z 

2x2 

~ 5 ~ 

J o 


647T 

~1T' 


Note: 

It may be required to find the volume of revolution about the y-axis of an area which is 
contained between a curve and the y-axis from y = c to y = d. 


4 





1. By using a straight line through the origin, obtain a formula for the volume, V, of a 
solid right-circular cone with height, h, and base radius, r. 

2. Determine the volume obtained when the segment straight line 

y = 5 — 4a;, 

lying between x = 0 and x — 1, is rotated through 2n radians about (a) the a;- axis and 
(b) the y- axis. 

3. Determine the volume obtained when the part of the curve 

y = cos 3a;, 

lying between x — | and x — |, is rotated through 27 t radians about the x-axis. 


5 



4. Determine the volume obtained when the part of the curve 

1 

V /n , ’ 

xy2 + x 

lying between x = 2 and x = 7, is rotated through 2tt radians about the x-axis. 

5. Determine the volume obtained when the part of the curve 

1 

^ (x — l)(x — 5) ’ 

lying between x = 6 and x = 8, is rotated through 2tt radians about the y-axis. 

6. Determine the volume obtained when the part of the curve 

x = ye~ y , 

lying between y — 0 and y — 1, is rotated through 2i r radians about the y-axis. 

7. Determine the volume obtained when the part of the curve 

y = sin 2x , 

lying between x — | and x — |, is rotated through 2n radians about the y-axis. 

8. Determine the volume obtained when the part of the curve 

y = x(l — x ) 3 , 

lying between x = 0 and x — 1, is rotated through 2tt radians about the x-axis. 

9. Determine the volume obtained when the part of the curve 

x=(4-y 2 ) 2 , 

lying between y = 1 and y = 2, is rotated through 27 t radians about the x-axis. 

10. Determine the volume obtained when the part of the curve 

y = xsec(x 3 ), 

lying between x = 0 and x = 0.5, is rotated through 27 t radians about the x-axis. 

11. Determine the volume obtained when the part of the curve 

1 

y = 2 — 7 ’ 
x z — 1 

lying between x = 2 and x = 3 is rotated through 2n radians about the y-axis. 


6 



13.3.4 ANSWERS TO EXERCISES 


1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 


V = 


1 

3 


7r r 2 h. 


7 r (TT 

(a) - ~ 1.047 (b) — ~ 7.330 

o o 


12 


~ 0.822 


0.214 approximately. 
8.010 approximately. 
0.254 approximately. 
3.364 approximately. 

— ~ 0.698 
9 

9tt ~ 28.274 
0.132 approximately. 
3.081 approximately. 
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UNIT 13.4 - INTEGRATION APPLICATIONS 4 - LENGTHS OF CURVES 


13.4.1 THE STANDARD FORMULAE 

The problem, in this unit, is to calculate the length of the arc of the curve with equation 


V = f{x), 


joining the two points, P and Q, on the curve, at which x = a and x = b. 



For two neighbouring points along the arc, the part of the curve joining them may be 
considered, approximately, as a straight line segment. 

Hence, if these neighbouring points are separated by distances of 5x and 5y, parallel to 
the x-axis and the y-axis respectively, then the length, 5s, of arc between them is given, 
approximately, by 


5s ~ \j (5x) 2 + ( 5y ) 2 




2 

5x, 


using Pythagoras’s Theorem. 

The total length, s, of arc is thus given by 


s = 




2 

5x. 


1 



That is, 


s = 




dx. 


Notes: 

(i) If the curve is given parametrically by 


then 


Hence, 


x = x(t), y = y(t), 


/-] rp dj) 

UX dt 




provided (] , r t is positive on the arc being considered. If not, then the above line needs to be 
prefixed by a negative sign. 

From the technique of integration by substitution, 





2 


da; 

dt 


dt, 


where t = t\ when x = a and t = t 2 when x = b. 
We may conclude that 





dt, 


s — ± 


2 




acccording as ^ is positive or negative, 
(ii) For an arc whose equation is 


x = g(y), 

contained between y = c and y = d, we may reverse the roles of x and y, so that the length 
of the arc is given by 


s = 




d y. 


EXAMPLES 

1. A curve has equation 


9 y 2 = 16a: 3 . 


Determine the length of the arc of the curve between the point (l, and the point 

( 4 -fl 

Solution 

We may write the equation of the curve in the form 


and so, 


Hence, 


y = 


Ax 2 

IT’ 


d y 

da; 


1 

= 2a; 2 . 



\/l + Ax da; 


(1 + 4a;) 2 
6 


17f 


6 


5 2 

— ~ 13.55 
6 


3 



2. A curve is given parametrically by 


x = t 2 — 1, y — t 3 + 1. 


Determine the length of the arc of the curve between the point where t — 0 and the 
point where t — 1. 

Solution 

Since 


da; 

d t 


2 1 and 


dy 

dt 


3 1 2 


we have 


s = 



V 4f 2 + 6 f 4 dt 



tV4 + 6t 2 dt 



~ 1.31 


13.4.2 EXERCISES 


1. A straight line has equation 


y = 3x + 2. 

Use (a) elementary trigonometry and (b) definite integration to determine the length 
of the line segment joining the point where x = 3 and the point where x = 7. 

2. A curve has equation 


± 2 ^ i 

y = -x — - mx. 
y 2 4 

Determine the length of the arc of the curve between x — 1 and x = e. 
3. A curve has equation 


x = 2 (y + 3)2. 

Determine the length of the arc of the curve between y = —2 and y — 1, stating your 
answer in decimals correct to four significant figures. 


4 



4. A curve is given parametrically by 


x — t — sin t, y = 1 — cos t. 

Determine the length of the arc of the curve between the point where t 
point where t — 2n. 

5. A curve is given parametrically by 

x = 4(cos# + # sin#), y = 4(sin# — 6 cos#). 

Determine the length of the arc of the curve between the point where # 
point where 9 — |. 

6. A curve is given parametrically by 

x = e u sin u, y = e u cos u. 

Determine the length of the arc of the curve between the point where u 
point where u — 1. 

13.4.3 ANSWERS TO EXERCISES 

1. 

4VTo ~ 12.65 

2 . 



3. 


4. 


14.33 


5. 


71 


2 


8 ■ 


6 . 


V2(e - 1) ~ 2.43 


0 and the 


0 and the 


0 and the 


5 
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UNIT 13.5 - INTEGRATION APPLICATIONS 5 


SURFACES OF REVOLUTION 

13.5.1 SURFACES OF REVOLUTION ABOUT THE X-AXIS 

The problem, in this unit, is to calculate the surface area obtained when the arc of the curve, 
with equation 


V = f{x), 

joining the two points, P and Q, on the curve, at which x = a and x = b, is rotated through 
2n radians about the x-axis or the y- axis. 



For two neighbouring points along the arc, the part of the curve joining them may be 
considered, approximately, as a straight line segment. 

Hence, if these neighbouring points are separated by distances of 5x and by, parallel to 
the x-axis and the y-axis, respectively, then the length, 5s, of arc between them is given, 
approximately, by 


5s ~ \J ( 5x ) 2 + ( 5y ) 2 




2 

5x, 


using Pythagoras’s Theorem. 

When the arc, of length 5s, is rotated through 2n radians about the x-axis, it generates a 
thin band whose area is, approximately, 


1 



2 


2ny6s 



1 + 



5x. 


The total surface area, S, is thus given by 


That is, 


S = lim 

& c — >0 


2n v 


\ 


i + 



2 

5x. 



2ny 


\ 


i + 



da;. 


Note: 

If the curve is given parametrically by 


then, 


Hence, 


x = x{t), y — y(t), 


d» = l 

Arr dx ' 

UX At 




provided is positive on the arc being considered. If not, then the above line needs to be 
prefixed by a negative sign. 


2 




From the technique of integration by substitution, 



where t = t\ when x = a and t — when x = b. 
We may conclude that 



acccorcling as is positive or negative. 

EXAMPLES 


1. A curve has equation 

y 2 = 2x. 

Determine the surface area obtained when the arc of the curve between the point (2, 2)) 
and the point (8,4) is rotated through 27 t radians about the x-axis. 

Solution 

We may write the equation of the arc of the curve in the form 

y = \/2x = \^2x^] 

and so, 

dy 1 r~ _1 1 

dx 2 ^2x 

Hence, 



3 



2. A curve is given parametrically by 


x = \/2cos 9, y = V^sin 6. 

Determine the surface area obtained when the arc of the curve between the point (0, y/2) 
and the point (1, 1) is rotated through 2n radians about the x-axis. 

Solution 

The parameters of the two points are | and |, respectively; and, since 

— = — \/2sin 6 and = \/2cos 6, 
at) at) 

we have 

7T 7T 

S = — / 4 2V2-K sin 6\J 2sin 2 d + 2cos 2 6* dO = — f ^ 47rsin 9 d 9. 

•'f ■'f 

Thus, 


S 


[— 47T cos d] | 


~ 8.89 

V2 


13.5.2 SURFACES OF REVOLUTION ABOUT THE Y-AXIS 

For a curve whose equation is of the form x = g(y ), the surface of revolution about the y-axis 
of an arc joining the two points at which y = c and y — d is given by 


S — 2irx 


\ 


1 + ur dy - 

\dy 


We simply reverse the roles of x and y in the previous section. 


4 




Alternatively, if the curve is given parametrically, 



acccording as is positive or negative. 

EXAMPLE 

If the arc of the parabola, with equation 


x 2 = 2 y, 

joining the two points (2,2) and (4,8), is rotated through 27r radians about the y-axis, 
determine the surface area obtained. 

Solution 

Using the result from the previous section, the surface area obtained is given by 




19.64 


5 



13.5.3 EXERCISES 


1. Use a straight line through the origin to determine the surface area of a right-circular 
cone with height, h, and base radius, r. 

2. Determine the surface area obtained when the arc of the curve x = y 3 , between y = 0 
and y — 1, is rotated through 27 t radians about the y- axis. 

3. A curve is given parametrically by 

x = t — sin t , y = 1 — cos t. 

Determine the surface area obtained when the arc of the curve between the point where 
t — 0 and the point where t — | is rotated through 27r radians about the x-axis. 

State your answer correct to three places of decimals. 

4. A curve is given parametrically by 

x = 4(cos6 l + 9 sin#), y = 4(sin 9 — 9 cos 9). 

Determine the surface area obtained when the arc of the curve between the point where 
9 = 0 and the point where 9 = | is rotated through 2n radians about the x-axis. 

5. A curve is given parametrically by 

x = e u cos u, y = e u sin u. 

Determine the surface area obtained when the arc of the curve between the point where 
u = 0 and the point where u = | is rotated through 27 t radians about the y-axis. 

State your answer correct to three places of decimals. 


6 



13.5.4 ANSWERS TO EXERCISES 


1 . 


2 . 


3. 

4. 


5. 


7r rVr 2 + h 2 . 

7r(10«/L0 - 1) ^ 
27 “ 

3.891 



1.037 


3.56 


53.54 
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UNIT 13.6 - INTEGRATION APPLICATIONS 6 


FIRST MOMENTS OF AN ARC 
13.6.1 INTRODUCTION 

Suppose that C denotes an arc (with length s) in the xy-plane of cartesian co-ordinates; and 
suppose that 5s is the length of a small element of this arc. 

Then the “first moment” of C about a fixed line, /, in the plane of C is given by 

lim 'Y' h5s , 

<5s^0 ’ 

where h is the perpendicular distance, from /, of the element with length 5s. 



13.6.2 FIRST MOMENT OF AN ARC ABOUT THE Y-AXIS 

Let us consider an arc of the curve, whose equation is 


V = f 0), 


joining two points, P and Q, at x = a and x = b, respectively. 


1 



a 


5x b 


The arc may divided up into small elements of typical length, 5s, by using neighbouring 
points along the arc, separated by typical distances of 5x (parallel to the x-axis) and 5y 
(parallel to the y-axis) . 

The first moment of each element about the y-axis is x times the length of the element; that 
is x5s, implying that the total first moment of the arc about the y-axis is given by 

lim V x5s. 

Ss^O V 


But, from Pythagoras’ Theorem, 


5s ~ \j (5x) 2 + ( 5y ) 2 




2 

5x, 


so that the first moment of the arc becomes 


lim 

>0 


E 


x=a 


X 


\ 


i+ 




X 


\ 


1 + 



dx. 


Note: 

If the curve is given parametrically by 


x = x(t), y = y(t ), 


2 



then, using the same principles as in Unit 13.4, we may conclude that the first moment of 
the arc about the y-axis is given by 


± 





dt, 


according as is positive or negative. 

13.6.3 FIRST MOMENT OF AN ARC ABOUT THE X-AXIS 

(a) For an arc whose equation is 


V = f{x), 

contained between x = a and x — b, the first moment about the x-axis will be 



'\ 


1 + 



dx. 


Note: 

If the curve is given parametrically by 


x = x{t), y = y(t ), 

then, using the same principles as in Unit 13.4, the first moment of the arc about the avaxis 
is given by 


± 





dt, 


according as % is positive or negative. 


3 



(b) For an arc whose equation is 


x = g{y), 

contained between y = c and y = d, we may reverse the roles of x and y in section 13.6.2 so 
that the first moment of the arc about the x-axis is given by 



Note: 

If the curve is given parametrically by 

x = x(t), y = y(t), 

then, using the same principles as in Unit 13.4, we may conclude that the first moment of 
the arc about the x-axis is given by 



according as 4^ is positive or negative and where t — t,\ when y = c and t = t 2 when y — d. 


4 



EXAMPLES 


1. Determine the first moments about the x-axis and the y-axis of the arc of the circle, 
with equation 

2,2 2 

x + y — a - 

lying in the first quadrant. 

Solution 

y 


X 



Using implicit differentiation, we have 


2x + 2 y 


dy 

dx 


0, 


and hence, ^ 

7 ax y 

The hrst moment of the arc about the y-axis is therefore given by 



x\ 1 + 


X * 




But x 2 + y 2 = a 2 and y = \J a 2 — x 2 . 
Hence, 


first moment 



ax 

\/a 2 — x 2 


dx 



= o 


By symmetry, the first moment of the arc about the x-axis will also be a 2 . 
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2. Determine the first moments about the x-axis and the y-axis of the hrst quadrant arc 
of the curve with parametric equations 

x = acos 3 d, y = asin 3 #. 

Solution 



Firstly, we have 

— = — 3acos 2 0sind and = 3asin 2 d cos 9. 
d9 dd 

Hence, the hrst moment about the x-axis is given by 

rO 


y\j 9 a 2 cos 4 9 sin 2 9 + 9a 2 sin 4 dcos 2 d d.6, 
which, on using cos 2 9 + sin 2 # = 1, becomes 

7T 

2 asm 3 9. 3a cos 9 sin 9 d9 
Jo 

= 3a 2 / " sin 4 # cos 9 d 9 


= 3 a 2 


'sin 5 9' 


3 a 2 

X' 
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Similarly, the first moment of the arc about the y-axis is given by 


x 


\ 


— ^ f d 9— f 2 acos 3 #.(3a cos # sin #) d6 
cl 0 I \ cl 0 J o 


7T 

cos 5 # 

/ cos 4 d sin 6 d 9 = 3a" 


Jo 

5 


J o 


2 _ 3a 2 


though, again, this second result could be deduced, by symmetry, from the first. 


13.6.4 THE CENTROID OF AN ARC 

Having calculated the first moments of an arc about both the x-axis and the y-axis it is 
possible to determine a point, (x, y), in the xy-plane with the property that 

(a) The first moment about the y-axis is given by sx, where s is the total length of the arc; 
and 

(b) The first moment about the x-axis is given by sy, where s is the total length of the arc. 

The point is called the “centroid” or the “geometric centre” of the arc and, for an arc 
of the curve with equation y = f(x), between x = a and x = b, its co-ordinates are given by 


rb 

fa X \ 

1 1 + 

( did 

2 

) dx 

rb 

Ja \j 

/l+( 

ydxj 

~2 

dx 


and y 


rb 

Ja V\ 

h 

(dy' 

\ da: 

\ 2 

j dx 

si v 

'i+( 

dy\ 

da- / 

2 

dx 


Notes: 

(i) The first moment of an arc about an axis through its centroid will, by definition, be zero. 
In particular, if we take the y-axis to be parallel to the given axis, with x as the perpendicular 
distance from an element, 6s, to the y-axis, the first moment about the given axis will be 

'^2 { x ~ x)5s = ^2 x $ s — x '^2 Ss = sx — sx = 0. 
c c c 

(ii) The centroid effectively tries to concentrate the whole arc at a single point for the 
purposes of considering first moments. In practice, it corresponds, for example, to the 
position of the centre of mass of a thin wire with uniform density. 
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EXAMPLES 


1. Determine the cartesian co-ordinates of the centroid of the arc of the circle, with equa- 
tion 

2,2 2 
x y — (7. . 


lying in the first quadrant. 

Solution 



From an earlier example in this unit, we know that the first moments of the arc about 
the x-axis and the y - axis are both equal to a 2 . 

Also, the length of the arc is which implies that 

2 a _ 2 a 

x = — and y = — . 

7T 7T 

2. Determine the cartesian co-ordinates of the centroid of the first quadrant arc of the 
curve with parametric equations 


x = acos 3 d, y = asin 3 #. 



Solution 



From an earlier example in this unit, we know that 


— = — 3acos 2 d sin 9 and = 3asin 2 0 cos 9 
dd d 9 

and that the first moments of the arc about the x-axis and the y-axis are both equal to 

3a 2 
5 * 

Also, the length of the arc is given by 


dO — [ 2 \/9a 2 cos 4 6 l sin 2 d + 9a 2 sin 4 dcos 2 d dd. 
Jo 


This simplifies to 


3a / 2 cos 9 sin 9 d 9 = 3 a 
Jo 


Thus, 


2 a _ 2 a 

x = — and y — 

5 5 
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13.6.5 EXERCISES 


1. Determine the first moment about the y-axis of the arc of the curve with equation 

y = x 2 , 

lying between x = 0 and x — 1. 

2. Determine the first moment about the x-axis of the arc of the curve with equation 

x = by 2 , 

lying between y — 0.1 and y = 0.5. 

3. Determine the first moment about the x-axis of the arc of the curve with equation 

V = 2 y/x, 

lying between x = 3 and x = 24. 

4. Verify, using integration, that the centroid of the straight line segment, defined by the 
equation 

y = 3x + 2, 

from x = 0 to x = 1, lies at its centre point. 

5. Determine the cartesian co-ordinates of the centroid of the arc of the circle given para- 
metrically by 



show that 

d y 1 — x 

dx 2y / X 

Hence show that the centroid of the first quadrant arch of this curve lies at the point 
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13.6.6 ANSWERS TO EXERCISES 


1 . 

2 . 

3. 

4. 

5. 


5V5 — 1 


12 

13^26- y/2 
150 


0.85 


~ 0.43 


156. 


, ' and y=l 

2 y 2 


x — — ~ 4.77, y = 0. 

71 
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UNIT 13.7 - INTEGRATION APPLICATIONS 7 


FIRST MOMENTS OF AN AREA 
13.7.1 INTRODUCTION 

Suppose that R denotes a region (with area A) of the xy-plane of cartesian co-ordinates, and 
suppose that 6 A is the area of a small element of this region. 

Then the “first moment” of R about a fixed line, /, in the plane of R is given by 



where h is the perpendicular distance, from /, of the element with area, 5A. 



13.7.2 FIRST MOMENT OF AN AREA ABOUT THE Y-AXIS 

Let us consider a region in the first quadrant of the a;y-plane, bounded by the a;- axis, the 
lines x — a, x — b and the curve whose equation is 


V = f(x). 


1 



a 


5x b 


The region may divided up into small elements by using a network, consisting of neighbouring 
lines parallel to the y - axis and neighbouring lines parallel to the rr-axis. 

But all of the elements in a narrow ‘strip’ of width Sx and height y (parallel to the y- axis) 
have the same perpendicular distance, x , from the y- axis. 

Hence the first moment of this strip about the y - axis is x times the area of the strip; that 
is, x(ySx), implying that the total first moment of the region about the y-axis is given by 


x=b 

lim V xy5x 

6x^0 _ 



xy dx. 


Note: 

First moments about the x-axis will be discussed mainly in the next section of this Unit; 
but we note that, for a region of the first quadrant, bounded by the y- axis, the lines y = c, 
y — d and the curve whose equation is 


x = g(y), 


we may reverse the roles of x and y so that the first moment about the rc-axis is given by 



yx d y. 
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EXAMPLES 

1. Determine the first moment of a rectangular region, with sides of lengths a and b about 
the side of length b. 

Solution 



3 



2. Determine the first moment about the y-axis of the semi-circular region, bounded in 
the first and fourth quadrants by the y- axis and the circle whose equation is 

2,2 2 

x + y — a . 

Solution 


y 



Since there will be equal contributions from the upper and lower halves of the region, 
the first moment about the y-axis is given by 



Note: 

Although first moments about the re-axis will be discussed mainly in the next section 
of this Unit, we note that the symmetry of the above region shows that its first mo- 
ment about the re-axis would be zero; this is because, for each y(xSy), there will be 
a corresponding — y(rc5y) in calculating the first moments of the strips parallel to the 
re-axis. 

13.7.3 FIRST MOMENT OF AN AREA ABOUT THE X-AXIS 

In the first example of the previous section, a formula was established for the first moment 
of a rectangular region about one of its sides. This result may now be used to determine the 
first moment about the re-axis of a region enclosed in the first quadrant by the re-axis, the 
lines x — a, x — b and the curve whose equation is 

V = f(x). 


4 



a 


5x b 


If a narrow strip, of width 5x and height y, is regarded as approximately a rectangle, its first 
moment about the x-axis is | y 2 5x. Hence, the first moment of the whole region about the 
x-axis is given by 


lim 

>0 


x ~b 1 pb I 

53 o y25x = / o y2 dx - 

x=a " J a £ 


EXAMPLES 


1. Determine the first moment about the x-axis of the region, bounded in the first quad- 
rant, by the x-axis, the y-axis, the line x = 1 and the curve whose equation is 

y = x 2 + 1. 
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Solution 



2. Determine the first moment about the x-axis of the region, bounded in the first quad- 
rant, by the x-axis, the lines x — 1, x — 2 and the curve 

y = xe x . 

Solution 



6 



That is, 



13.7.4 THE CENTROID OF AN AREA 

Having calculated the first moments of a two dimensional region about both the x-axis and 
the y-axis, it is possible to determine a point, (x, y), in the xy-plane with the property that 

(a) The first moment about the y-axis is given by Ax, where A is the total area of the region; 
and 

(b) The first moment about the x-axis is given by Ay, where A is the toal area of the region. 

The point is called the “centroid” or the “geometric centre” of the region and, in the 
case of a region bounded, in the first quadrant, by the x-axis, the lines x = a, x = b and the 
curve y — f(x), its co-ordinates are given by 


_ _ la xy dx 
la ydx 


and y 


fa M dx ^ 

fa y dx 


Notes: 

(i) The first moment of an area, about an axis through its centroid will, by definition, be 
zero. In particular, if we take the y-axis to be parallel to the given axis, with x as the 
perpendicular distance from an element, 5 A, to the y-axis, the first moment about the given 
axis will be 


22 (x — x)5A = '22 X &A — x 22 

R R R 


6 A = Ax — Ax 


0 . 
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(ii) The centroid effectively tries to concentrate the whole area at a single point for the 
purposes of considering first moments. In practice, it corresponds to the position of the 
centre of mass for a thin plate with uniform density, whose shape is that of the region which 
we have been considering. 

EXAMPLES 

1. Determine the position of the centroid of a rectangular region with sides of lengths, a 
and b. 

Solution 



The area of the rectangle is ab and, from Example 1 in section 13.7.2, the first moments 
about the y - axis and the avaxis are \a 2 b and \ b 2 a , respectively. 

Hence, 

\a 2 b 1 
x = , = -a 

ab 2 

and 


V = 


\b 2 a, 

ab 



as we would expect for a rectangle. 



2. Determine the position of the centroid of the semi-circular region bounded, in the first 
and fourth quadrants, by the y-axis and the circle whose equation is 

2,2 2 
x + y —a . 


Solution 



The area of the semi-circular region is |vra 2 and so, from Example 2, in section 13.7.2, 


x = 


| a 3 
\tx a 2 


4 a 
3 tt 


and y 


0 . 


3. Determine the position of the centroid of the region, bounded in the first quadrant, by 
the x-axis, the y-axis, the line x — 1 and the curve whose equation is 

y = x 2 + 1. 
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Solution 



The first moment about the y-axis is given by 


The area is given by 


Hence, 



x(x 2 + 1) dx 




3 

T 



(x 2 + 1) dx 



+ x 


4 

3' 



4 

3 


1 . 


The first moment about the x-axis is from Example 1 in section 13.7.3; and, therefore, 


V = 



7 

5 


4. Determine 
the x-axis, 


the position of the centroid of the region bounded in the first quadrant by 
the lines x — 1, x — 2 and the curve whose equation is 


y = xe x . 
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Solution 



The first moment about the y-axis is given by 


J x 2 e x da: = 

using integration by parts (twice). 
The area is given by 

r2 


x 2 e x - 2xe x + 2e x 


1 2 


J 1 


~ 12.06, 


xe x dx = \xe x — e x ] 2 ~ 7.39 


using integration by parts (once). 
Hence, 


x ~ 12.06 -j- 7.39 ~ 1.63 

The first moment about the oxaxis is approximately 33.20, from Example 2 in section 
13.7.3; and so, 


y ~ 33.20 ^ 7.39 ~ 4.47 


13.7.5 EXERCISES 

Determine the position of the centroid of each of the following regions of the xy-plane: 

1. Bounded in the first quadrant by the a;-axis, the y-axis and the curve whose equation is 

2,2 2 
x + y = a . 
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2. Bounded by the line x — 1 and the semi-circle whose equation is 

(x — l) 2 + y 2 = 4, x > 1. 

3. Bounded in the fourth quadrant by the x-axis, the y-axis and the curve whose equation 

is 

y = 2x 2 — 1. 

4. Bounded in the first quadrant by the x-axis and the curve whose equation is 

y = sinx. 

5. Bounded in the hrst quadrant by the x-axis, the y-axis, the line x = 1 and the curve 
whose equation is 


— 2r 

y = xe 


13.7.6 ANSWERS TO EXERCISES 


1 . 


2 . 


3. 


4. 


5. 


/ 4a 4a \ 
V3vr’ 37r/ ' 



3\/2 13 \ 

T6~ ,_ 20; ' 


7 r 7T 
2 ’ 8 


(0.28,0.04). 
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UNIT 13.8 - INTEGRATION APPLICATIONS 8 


FIRST MOMENTS OF A VOLUME 
13.8.1 INTRODUCTION 

Suppose that R denotes a region of space (with volume V) and suppose that 5V is the volume 
of a small element of this region. 

Then the “first moment” of R about a fixed plane, p, is given by 


lim Y' h5V, 
5V^ 0 V 

rv. 


where h is the perpendicular distance, from p, of the element with volume, 8V . 



13.8.2 FIRST MOMENT OF A VOLUME OF REVOLUTION ABOUT A PLANE 
THROUGH THE ORIGIN, PERPENDICULAR TO THE X-AXIS. 

Let us consider the volume of revolution about the a;-axis of a region, in the first quadrant 
of the rr^-plane, bounded by the x-axis, the lines x — a, x — b and the curve whose equation 

is 


V = f(x). 


1 



a 


5x b 


For a narrow ‘strip’ of width, 5x, and height, y, parallel to the y- axis, the volume of revolution 
will be a thin disc with volume Tiy 2 5x and all the elements of volume within it have the same 
perpendicular distance, x, from the plane about which moments are being taken. 

Hence the first moment of this disc about the given plane is x times the volume of the disc; 
that is, x(ny 2 5x), implying that the total first moment is given by 


lim 

& E — >0 


x=b 

Y, TTxy 2 5x 

x=a 



7T xy 2 da;. 


Note: 

For the volume of revolution about the y-axis of a region in the first quadrant, bounded by 
the y-axis, the lines y = c, y = d and the curve whose equation is 


x = g(y), 


we may reverse the roles of x and y so that the first moment of the volume about a plane 
through the origin, perpendicular to the y-axis, is given by 



7T yx 2 dy. 
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EXAMPLES 

1. Determine the first moment of a solid right-circular cylinder with height, a and radius 
b, about one end. 

Solution 


y 



Let us consider the volume of revolution about the x-axis of the region, bounded in the 
first quadrant of the xy- plane, by the x-axis, the y-axis and the lines x = a, y = b. 

The first moment of the volume about a plane through the origin, perpendicular to the 
x-axis, is given by 


3 



[ a „ 

7 rx 2 6 2 

/ 7 rxcr dx = 


Jo 

2 


7r a 2 b 2 
2 


2. Determine the first moment of volume, about its plane base, of a solid hemisphere with 
radius a. 

Solution 

Let us consider the volume of revolution about the x-axis of the region, bounded in the 
first quadrant, by the x-axis, y-axis and the circle whose equation is 


2,2 2 
x + y = a . 



The hrst moment of the volume about a plane through the origin, perpendicular to the 
x-axis is given by 

7 rx(a 2 — x 2 ) dx = 

Note: 

The symmetry of the solid figures in the above two examples shows that their hrst 
moments about a plane through the origin, perpendicular to the y-axis would be zero. 
This is because, for each y5V in the calculation of the total hrst moment, there will be 
a corresponding —y8V. 

In much the same way, the hrst moments of volume about the xy-plane (or indeed any 
plane of symmetry) would also be zero. 
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13.8.3 THE CENTROID OF A VOLUME 


Suppose R denotes a volume of revolution about the x-axis of a region of the xy- plane, 
bounded by the x-axis, the lines x — a, x — b and the curve whose equation is 


V = f{x). 


Having calculated the first moment of R about a plane through the origin, perpendicular 
to the x-axis (assuming that this is not a plane of symmetry), it is possible to determine a 
point, (x, 0), on the x-axis with the property that the first moment is given by Vx, where V 
is the total volume of revolution about the x-axis. 

The point is called the “centroid” or the “geometric centre” of the volume, and x is 
given by 


la 7 rxy 2 dx = J b xy 2 dx 
fa 7ry 2 dx f b a y 2 dx ' 


Notes: 

(i) The centroid effectively tries to concentrate the whole volume at a single point for the 
purposes of considering first moments. It will always lie on the line of intersection of any 
two planes of symmetry. 

(ii) In practice, the centroid corresponds to the position of the centre of mass for a solid 
with uniform density, whose shape is that of the volume of revolution which we have been 
considering. 

(iii) For a volume of revolution about the y-axis, from y = c to y = d, the centroid will lie 
on the y-axis, and its distance, y, from the origin will be given by 


_ = Ic ^yx 2 dy = / c d yx 2 dy 
Sc vrx 2 dy / c d x 2 dy 

(iv) The first moment of a volume about a plane through its centroid will, by definition, be 
zero. In particular, if we take the plane through the y-axis, perpendicular to the x-axis to 
be parallel to the plane through the centroid, with x as the perpendicular distance from an 
element, 5V, to the plane through the y-axis, the first moment about the plane through the 
centroid will be 
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EXAMPLES 


53 (X - x)5V = 53 x5V — x 53 5V = Vx-Vx = 0. 

R R R 


1. Determine the position of the centroid of a solid right-circular cylinder with height, a, 
and radius, b. 

Solution 


y 


b 


6 

a 


x 


Using Example 1 in Section 13.8.2, the centroid will lie on the x-axis and the first 

2 l.2 

moment about a plane through the origin, perpendicular to the x-axis is ' Ka 2 t> . 

Also, the volume is Tib 2 a. 


Hence, 


na 2 b 2 
T = 2 

nb 2 a 


a 

2’ 


as we would expect for a cylinder. 
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2. Determine the position of the centroid of a solid hemisphere with base-radius, a. 

Solution 



Let us consider the volume of revolution about the x-axis of the region bounded in the 
first quadrant by the x-axis, the y- axis and the circle whose equation is 

2,2 2 

x + y = a 

From Example 2 in Section 13.8.2, the centroid will lie on the x-axis and the first 
moment of volume about a plane through the origin, perpendicular to the x-axis is 

Also, the volume of the hemisphere is |7ra 3 and so, 

§7ra 3 3a 

/-y> d 


3. Determine the position of the centroid of the volume of revolution about the y-axis 
of region bounded in the first quadrant by the x-axis, the y-axis and the curve whose 
equation is 

y = 1 — x 2 . 
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Solution 



Firstly, by symmetry, the centroid will lie on the y- axis. 

Secondly, the first moment about a plane through the origin, perpendicular to the y - axis 
is given by 


[ ny{ 1 - y) dy = vr 
Jo 


y_ 

2 


r 

3 


n i 


J o 


7T 

6 ' 


Thirdly, the volume is given by 



*■(1 - V ) dy 


r 

2 


7T 

2 ' 


Hence, 


7T _ 7T 1 

6 “ 2 “ 3' 


4. Determine the position of the centroid of the volume of revolution about the x-axis of 
the region, bounded in the hrst quadrant by the x-axis, the lines x = 1, x = 2 and the 
curve whose equation is 


V = e x . 



Solution 



Firstly, by symmetry, the centroid will lie on the x axis. 

Secondly, the First Moment about a plane through the origin, perpendicular to the 
x-axis is given by 



nxe 2x dx = 7 r 




122.84, 


using integration by parts. 
The volume is given by 


p 2 

"2a:" 

/ ne 2x dx = 7 r 

o 

Ji 

2 


~ 74.15 


Hence, 


x ~ 122.84 4- 74.15 ~ 1.66 


13.8.4 EXERCISES 

1. Determine the position of the centroid of the volume obtained when each of the following 
regions of the xy - plane is rotated through 2ti radians about the x-axis: 

(a) Bounded in the first quadrant by the x-axis, the line x = 1 and the quarter-circle 
represented by 


(x — l) 2 + y 2 = 4, x > 1, y > 0; 
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(b) Bounded in the first quadrant by the x-axis, the y-axis and the curve whose equation 

is 

V = 1 - 2a; 2 ; 

(c) Bounded in the first quadrant by the x-axis, the y-axis, the line x — | and the 
curve whose equation is 

y = sin a;; 

(d) Bounded in the first quadrant by the a;-axis, the y-axis, the line x = 1 and the 
curve whose equation is 


y = \fxe x . 

2. A solid right-circular cone, whose vertex is at the origin, has, for its central axis, the 
part of the y-axis between y = 0 and y = h. Determine the position of the centroid of 
the cone. 


13.8.5 ANSWERS TO EXERCISES 


L (a) 

(b) 

(c) 

(d) 


2 . 


x = 1.75; 
x ~ 0.22; 
x ~ 1.10; 


x ~ 0.36 


V = 


3 h 

T' 
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UNIT 13.9 - INTEGRATION APPLICATIONS 9 


FIRST MOMENTS OF A SURFACE OF REVOLUTION 
13.9.1 INTRODUCTION 

Suppose that C denotes an arc (with length s) in the xy-plane of cartesian co-ordinates, and 
suppose that 5s is the length of a small element of this arc. 

Then, for the surface obtained when the arc is rotated through 2i r radians about the x-axis, 
the “first moment” about a plane through the origin, perpendicular to the x-axis, is given 
by 


lim 'S'' 2nxy5s, 

Ss^o c 

where x is the perpendicular distance, from the plane of moments, of the thin band, with 
surface area 2ny5s, so generated. 

13.9.2 INTEGRATION FORMULAE FOR FIRST MOMENTS 

(a) Let us consider an arc of the curve whose equation is 


V = }{x), 

joining two points, P and Q, at x = a and x = 6, respectively. 



The arc may be divided up into small elements of typical length, 5s , by using neighbouring 


1 



points along the arc, separated by typical distances of Sx (parallel to the x-axis) and Sy 
(parallel to the y-axis) . 

From Pythagoras’ Theorem, 



so that, for the surface of revolution of the arc about the x-axis, the hrst moment becomes 

lim 2nxy\ 1 + ( Sx — [ 2tt xy x 1 + ( dx. 

5x^0 ^ \Sx) Ja Aj \dxj 

Note: 

If the curve is given parametrically by 

x = x(t), y = y(t), 

then 

d|/ = i 

A , r dx • 

UX dt 

Hence, 


dt 

provided that is positive on the arc being considered. If not, then the above line needs 
to be prefixed by a negative sign. 




2 




From the technique of integration by substitution, 


2tt xy 


\ 


1 + 



rt2 

dx= 2nxy 
Jt i 


\ 


i + 



da; 
’ dt 


dt, 


where t = t\ when x = a and t = t 2 when x = b. 

We may conclude that the first moment about the plane through the origin, perpendicular 
to the a;-axis is given by 


First Moment 





dt, 


according as is positive or negative, 
(b) For an arc whose equation is 


x = g(y), 

contained between y = c and y = d, we may reverse the roles of x and y in the previous 
section so that the first moment about a plane through the origin, perpendicular to the 
y-axis is given by 
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Note: 

If the curve is given parametrically by 


x = x(t), y = y(t), 


where t = t\ when y = c and t = t -2 when y — d, then the first moment about a plane 
through the origin, perpendicular to the y- axis, is given by 


First moment 



df, 


according as is positive or negative. 

EXAMPLES 


1. Determine the first moment about a plane through the origin, perpendicular to the 
x-axis, for the hemispherical surface of revolution (about the x-axis) of the arc of the 
circle whose equation is 


2,2 2 
x + y = a , 


lying in the first quadrant. 


4 



Solution 



Using implicit differentiation, we have 


2x + 2 y 


d y 

da; 


0 


and hence, ^ 

’ ax y 

The first moment about the specified plane is therefore given by 



But x 2 + y 2 = a 2 , and so the first moment becomes 



2ttcix dx = [7raa; 2 ]o = n a 3 . 


2. Determine the first moments about planes through the origin, (a) perpendicular to the 
a;-axis and (b) perpendicular to the y-axis, of the first quadrant arc of the curve with 
parametric equations 


x = acos 3 0, y = asin 3 #. 


5 



Solution 



Firstly, we have 

= — 3acos 2 0sin0 and — ^ = 3asin 2 0 cos 0. 
d0 d0 

Hence, the first moment about the x-axis is given by 

— j 27rxy-\/9a 2 cos 4 0sin 2 0 + 9a 2 sin 4 0cos 2 0 d 9, 
which, on using cos 2 0 + sin 2 0 = 1, becomes 

7 r 7 r 

/ 27ra 2 cos 3 0sin 3 0.3acos0sin0 c!0 = /" 67ra 3 cos 4 0sin 4 0 c!0. 


Using 2 sin 9 cos 9 = sin 20, the integral reduces to 

3yra 3 . 4 
/ sm 20 c!0, 


which, by the methods of Unit 12.7, gives 


37ra 3 f 

32 Jo 


f h 0 l + cosi 

I 1 — 2 cos 40 H — 


d0 = 


37 ra 3 
32 


30 sin 40 | sin8i 

T 2 + YfT 


2 97ra 3 

, “ 1YT 


By symmetry, or by direct integration, the first moment about a plane through the 
origin, perpendicular to the y-axis is also 


6 



13.9.3 THE CENTROID OF A SURFACE OF REVOLUTION 


Having calculated the first moment of a surface of revolution about a plane through the 
origin, perpendicular to the rr-axis, it is possible to determine a point, (x, 0), on the x-axis 
with the property that the first moment is given by Sx, where S is the total surface area. 

The point is called the “centroid” or the “geometric centre” of the surface of revolution 
and, for the surface of revolution of the arc of the curve whose equation is y = f(x), between 
x = a and x = b, the value of x is given by 


fa 27 rX V\ 

l 1 + 

(I)' d* 

f b 

Ja 

x'V' 

/ i+ 

(a;) 2 ^ 

fa 27 r V\ 

A+l 


r b 

Ja 

y\j 

'i+t 

;£) 2 ^ 


Note: 

The centroid effectively tries to concentrate the whole surface at a single point for the 
purposes of considering first moments. In practice, it corresponds to the position of the 
centre of mass of a thin sheet, for example, with uniform density. 

EXAMPLES 

1. Determine the position of the centroid of the surface of revolution (about the x-axis) of 
the arc of the circle whose equation is 

2,2 2 
x + y = a , 


lying in the first quadrant. 


7 




Solution 



From Example 1 of Section 13.9.2, we know that the first moment of the surface about 
a plane through the origin, perpendicular to the the x-axis is equal to n a 3 . 

Also, the total surface area is 

ra / /£ 2 pa 

/ 2ny\ l -\ — - dr = / 2ira dx = 2na 2 , 

Jo V V J o 

which implies that 

__ 7T a 3 a 

^ = W = 2’ 

2. Determine the position of the centroid of the surface of revolution (about the x-axis) of 
the Erst quadrant arc of the curve with parametric equations 


x = acos 3 0, y = asin'A. 



Solution 



We know from Example 2 of Section 13.9.2 that that the first moment of the surface 
about a plane through the origin, perpendicular to the x-axis is equal to 9na 

Also, the total surface area is given by 


128 


r o 

— / 27rasin 6.3a cos 6 sin 9 d6 — 

] \ 


3a 2 sin 4 9 cos 6 d9 = 3na 2 


sin 5 9 


3tt a 2 


Thus, 


15a 
' 128' 


9 



13.9.4 EXERCISES 


1. Determine the first moment, about a plane through the origin, perpendicular to the x- 
axis, of the surface of revolution (about the x-axis) of the straight-line segment joining 
the origin to the point (3,4). 

2. Determine the first moment about a plane through the origin, perpendicular to the 
x-axis, of the surface of revolution (about the x-axis) of the arc of the curve whose 
equation is 


y 2 = 4x, 


lying between x = 0 and x = 1. 

3. Determine the first moment about a plane through the origin, perpendicular to the 
y-axis, of the surface of revolution (about the y-axis) of the arc of the curve whose 
equation is 


V 2 = 4(x - 1), 


lying between y = 2 and y = 4. 

4. Determine the first moment, about a plane through the origin, perpendicular to the 
y-axis, of the surface of revolution (about the y-axis) of the arc of the curve whose 
parametricc equations are 


x = 2 cos t, y = 3 sin t, 
joining the point (2,0) to the point (0,3). 

5. Determine the position of the centroid of a hollow right-circular cone with height h. 

6. For the curve whose equation is 


9 y 2 = x(3 — x) 2 , 


show that 


dy 1 — x 
dx 2y / X 


Hence, show that the centroid of the surface obtained when the first quadrant arch of 
this curve is rotated through 27 t radians about the x-axis lies at the point oV 


10 



13.9.5 ANSWERS TO EXERCISES 


407T. 


2 . 


47T 


12\/2 4 

” 5 15 


~ 39.3 


8vr / y 

— i + 

5 I 4 


2 \ 2 


~ 41.98 


J 2 


4. 


47T 


^4 + 5cos 


x 31 
2A 2 


~ 47.75 


Jo 


5. Along the central axis, at a distance of 4^ from the vertex. 

6 . 

157 r 


First moment = 


Surface Area = 37 t. 
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UNIT 13.10 - INTEGRATION APPLICATIONS 10 


SECOND MOMENTS OF AN ARC 
13.10.1 INTRODUCTION 

Suppose that C denotes an arc (with length s) in the xy-plane of cartesian co-ordinates, and 
suppose that 5s is the length of a small element of this arc. 

Then the “second moment” of C about a fixed line, /, in the plane of C is given by 

lim h 2 5s, 

5s-> 0 ^ ’ 

where h is the perpendicular distance, from /, of the element with length 5s. 



13.10.2 THE SECOND MOMENT OF AN ARC ABOUT THE Y-AXIS 

Let us consider an arc of the curve whose equation is 


V = f(x), 


joining two points, P and Q, at x = a and x = b, respectively. 


1 



a 


6x b 


The arc may be divided up into small elements of typical length, 6s, by using neighbouring 
points along the arc, separated by typical distances of Sx (parallel to the x-axis) and 5y 
(parallel to the y-axis) . 

The second moment of each element about the y-axis is x 2 times the length of the element; 
that is, x 2 5s, implying that the total second moment of the arc about the y-axis is given by 

lim V x 2 6s. 

Ss^O V 


But, by Pythagoras’ Theorem, 


6s ~ \j (6x) 2 + ( 6y ) 2 




2 

Sx, 


so that the second moment of arc becomes 








dx. 


Note: 

If the curve is given parametrically by 


x = x{t), y = y(t ), 


2 



then, using the same principles as in Unit 13.4, we may conclude that the second moment 
of the arc about the y-axis is given by 


± 





df, 


according as is positive or negative. 

13.10.3 THE SECOND MOMENT OF AN ARC ABOUT THE X-AXIS 

(a) For an arc whose equation is 


V = f{x), 

contained between x = a and x — b, the second moment about the x-axis will be 





dx. 


Note: 

If the curve is given parametrically by 


x = x{t), y = y(t), 

then, using the same principles as in Unit 13.4, the second moment of the arc about the 
x-axis is given by 


± 





df, 


according as % is positive or negative. 


3 



(b) For an arc whose equation is 


x = g{y), 

contained between y = c and y = d, we may reverse the roles of x and y in section 13.10.2 
so that the second moment about the x-axis is given by 



Note: 

If the curve is given parametrically by 

x = x(t), y = y(t), 

then, using the same principles as in Unit 13.4, we may conclude that the second moment 
of the arc about the x-axis is given by 



according as 4^ is positive or negative and where t — U when y = c and t = t 2 when y — d. 


4 



EXAMPLES 


1. Determine the second moments about the x-axis and the y-axis of the arc of the circle 
whose equation is 


2,2 2 

x T y — a - 


lying in the first quadrant. 

Solution 



Using implicit differentiation, we have 

2x + 2 y^ = 0 
dx 

and hence, ^ 

’ ax y 

The second moment about the y-axis is therefore given by 



But x 2 + y 2 = a 2 and, hence, 


second moment 



ax 2 

dx. 

y 


5 



Making the substitution x = a sin u gives 


second moment = 


a 3 sin 2 ?j d u = a 3 


1 — cos 2 u 


d u = a 3 


u sin 2 u 

2 


Jo 


By symmetry, the second moment about the x-axis will also be 

2. Determine the second moments about the x-axis and the y - axis of the first quadrant 
arc of the curve with parametric equations 

x = acos 3 d, y = asin 3 d. 


Solution 



Firstly, we have 

— = — 3acos 2 0sind and — ^ = 3asin 2 d cos 9. 
dd d 0 

Hence, the second moment about the y-axis is given by 

— J x 2 \/ 9 a 2 cos 4 9 sin 2 9 + 9a 2 sin 4 6 l cos 2 d d 6, 
which, on using cos 2 d + sin 2 d = 1, becomes 

pi 

/ a 2 cos 6 9. 3a cos 9 sin 9 d 9 


6 


to|d 



= 3a 3 f " cos 7 6 sin 9 d# 
Jo 


= 3 a 2 



3a 3 


8 


Similarly, the second moment about the x-axis is given by 


\ 


— dd = / 2 a 2 sin 6 #.(3a cos 9 sin 9) dQ 
d 9 V d 9 / Jo 


= 3a 3 / 2 sin 7 # cos# d# = 3a 3 
Jo 


sin 8 # 


2 _ 3a 3 
jo ® 


though, again, this second result could be deduced, by symmetry, from the first. 


13.10.4 THE RADIUS OF GYRATION OF AN ARC 

Having calculated the second moment of an arc about a certain axis it is possible to determine 
a positive value, k, with the property that the second moment about the axis is given by 
sk 2 , where s is the total length of the arc. 

We simply divide the value of the second moment by s in order to obtain the value of k 2 
and, hence, the value of k. 

The value of k is called the “radius of gyration” of the given arc about the given axis. 

Note: 

The radius of gyration effectively tries to concentrate the whole arc at a single point for the 
purposes of considering second moments; but, unlike a centroid, this point has no specific 
location. 


7 



EXAMPLES 


1. Determine the radius of gyration, about the y- axis, of the arc of the circle whose equation 

is 


2,2 2 
x ~b y — a ■ 


lying in the first quadrant. 

Solution 



From Example 1 in Section 13.10.3, we know that the Second Moment of the arc about 

3 

the y - axis is equal to ^ . 

Also, the length of the arc is which implies that the radius of gyration is 

/ 7T a 3 2 a 

V 4 7ra y/2 

2. Determine the radius of gyration, about the y- axis, of the first quadrant arc of the curve 
with parametric equations 


x = acos 3 0, y = asin 3 #. 



Solution 



From Example 2 in Section 13.10.3, we know that 

— = — 3acos 2 d sin 6 and = 3asin 2 0 cos 6 
dd d 0 

and that the second moment of the arc about the y-axis is equal to 
Also, the length of the arc is given by 



Thus, the radius of gyration is 



9 



13.10.5 EXERCISES 


1. Determine the second moments about (a) the x-axis and (b) the y- axis of the straight 
line segment with equation 


y — 2x + 1, 


lying between x = 0 and x = 3. 

2. Determine the second moment about the y - axis of the first-quadrant arc of the curve 
whose equation is 


25 y 2 = 4x 5 , 


lying between x = 0 and x — 2. 

3. Determine, correct to two places of decimals, the second moment, about the x-axis, of 
the arc of the curve whose equation is 


V = e x , 

lying between x — 0.1 and x = 0.5. 

4. Given that 

n 1 

J \J x 2 — 1 dx = - [x\Jx 2 — 1 — ln(x + \J x 2 — 1)) + C, 

determine, correct to two places of decimals, the second moment, about the x-axis, of 
the arc of the curve whose equation is 

y 2 = 8x, 

lying between x = 0 and x — 1. 

5. Verify, using integration, that the radius of gyration, about the y-axis, of the straight 
line segment defined by the equation 


y = 3x + 2, 


from x = 0 to x = 1 is 


10 



6. Determine the radius of gyration about the x-axis of the arc of the circle given para- 
metrically by 

x = 5cos 9, y — 5 sin#, 

from 9 = 0 to 9 = 

7. For the curve whose equation is 

9 y 2 = x(3 — x) 2 , 

show that 

d y 1 — x 

dx 2y/x 

Hence, determine, correct to three significant figures, the radius of gyration, about the 
y- axis, of the first quadrant arch of this curve. 
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UNIT 13.11 - INTEGRATION APPLICATIONS 11 


SECOND MOMENTS OF AN AREA (A) 

13.11.1 INTRODUCTION 

Suppose that R denotes a region (with area A) of the xy-plane in cartesian co-ordinates, and 
suppose that 5 A is the area of a small element of this region. 

Then the “second moment” of R about a fixed line, /, not necessarily in the plane of 

R, is given by 


lim 'Y' h 2 SA , 

R 


where h is the perpendicular distance from / of the element with area, 5 A. 



13.11.2 THE SECOND MOMENT OF AN AREA ABOUT THE Y-AXIS 

Let us consider a region in the first quadrant of the xy-plane bounded by the x-axis, the 
lines x — a, x — b and the curve whose equation is 


V = f{x). 


1 



a 


5x b 


The region may be divided up into small elements by using a network consisting of neigh- 
bouring lines parallel to the y-axis and neighbouring lines parallel to the x-axis. 

But all of the elements in a narrow ‘strip’, of width 5x and height y (parallel to the y-axis), 
have the same perpendicular distance, x, from the y-axis. 

Hence the second moment of this strip about the y-axis is x 2 times the area of the strip; that 
is, x 2 (ydx), implying that the total second moment of the region about the y-axis is given 
by 


lim 

>0 


x=b 

Y x 2 y5x 

x =a 



x 2 y da;. 


Note: 

Second moments about the axaxis will be discussed mainly in the next section of this Unit; 
but we note that, for a region of the first quadrant, bounded by the y-axis, the lines y = c, 
y — d and the curve whose equation is 


x = g(y), 


we may reverse the roles of x and y so that the second moment about the x-axis is given by 



y 2 x dy. 


2 




EXAMPLES 

1. Determine the second moment of a rectangular region with sides of lengths, a and b, 
about the side of length b. 

Solution 



3 



2. Determine the second moment about the y-axis of the semi-circular region, bounded in 
the first and fourth quadrants, by the y-axis and the circle whose equation is 

2,2 2 

x + y = a . 


Solution 



Since there will be equal contributions from the upper and lower halves of the region, 
the second moment about the y-axis is given by 


2 [ x 2 * V a 2 — x 2 dx = 2 I a 2 sin 2 #.a cos 9. a cos 9d9, 
Jo Jo 

if we substitute x = a sin#. 

This simplifies to 


2a 4 



sin 2 2# 

4 


d 9 


a 4 1 — cos 4# 

— / d# 

2 jo 2 


a 

T 


9- 


sin 4 9 


7i a 


13.11.3 THE SECOND MOMENT OF AN AREA ABOUT THE X-AXIS 

In the first example of the previous section, a formula was established for the second moment 
of a rectangular region about one of its sides. This result may now be used to determine the 
second moment about the x-axis, of a region enclosed, in the first quadrant, by the a;-axis, 
the lines x = a, x = b and the curve whose equation is y = f(x). 


4 



a 


5x b 


If a narrow strip of width Sx and height y is regarded, approximately, as a rectangle, its 
second moment about the x-axis is | y 3 5x . Hence the second moment of the whole region 
about the x-axis is given by 


lim 

>0 


x ~b 1 r>b I 

53 oV 35x = / ^y 3 dx ■ 

x=a ^ ** ° ^ 


EXAMPLES 


1. Determine the second moment about the x-axis of the region bounded, in the first 
quadrant, by the x-axis, the y-axis, the line x = 1 and the line whose equation is 

y — x + 1. 


5 



Solution 



2. Determine the second moment about the x-axis of the region, bounded in the first 
quadrant, by the x-axis, the y- axis, the line x = 1 and the curve 

y = xe x . 


6 



Solution 



Second moment 



-x 3 e 3x 

3 


da; 


1 

3 




1 

3 




That is, 



+ 


2xe 3x 

9 




1 

3 




2xe 3x 

9 


2e 3x l 1 


27 


J o 


4e 3 + 2 
81 


~ 1.02 


Note: 

The Second Moment of an area about a certain axis is closely related to its “moment of 
inertia” about that axis. In fact, for a thin plate with uniform density, p, the moment of 
inertia is p times the second moment of area, since multiplication by p, of elements of area, 
converts them into elements of mass. 
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13.11.7 EXERCISES 


Determine the second moment of each of the following regions of the xy-plane about the axis 
specified: 

1. Bounded in the first quadrant by the x-axis, the y-axis and the curve whose equation is 

y = 1 — 2x 2 . 

Axis: The y-axis. 

2. Bounded in the first quadrant by the x-axis and the curve whose equation is 

y = sinx. 

Axis: The x-axis. 

3. Bounded in the first quadrant by the x-axis, the y-axis, the line x = 1 and the curve 
whose equation is 

V = e- 2 A 

Axis: The x-axis 

4. Bounded in the first quadrant by the x-axis, the y-axis, the line x = 1 and the curve 
whose equation is 

V = e- 2 A 

Axis: The y-axis. 


13.11.8 ANSWERS TO EXERCISES 

1 . 

V2 

30 ' 

2 . 

4 

9 

3. 

0.055, approximately. 


4 . 


0.083, approximately. 
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UNIT 13.12 - INTEGRATION APPLICATIONS 12 


SECOND MOMENTS OF AN AREA (B) 

13.12.1 THE PARALLEL AXIS THEOREM 

Suppose that M g denotes the second moment of a given region, R, about an axis, g , through 
its centroid. 

Suppose also that Mi denotes the second moment of R about an axis, /, which is parallel to 
the first axis, in the same plane as R and having a perpendicular distance of d from the first 
axis. 



We have 

Mi = ( h + d ) 28A = E ( h2 + 2hd + A 

R R 

That is, 


M l = Y, h 2 $ A + 2d E h8A + d 2 E 8A = M a + Ad 2 > 

R R R 

since the summation, X)r hSA, is the first moment about the an axis through the centroid 
and therefore zero; (see Unit 13.7, section 13.7.4). 


1 



The Parallel Axis Theorem states that 

M t = M g + Ad 2 . 

EXAMPLES 

1. Determine the second moment of a rectangular region about an axis through its centroid, 
parallel to one side. 

Solution 



For a rectangular region with sides of length a and b, the second moment about the side 
of length b is ^ from Example 1 in the previous Unit, section 13.11.2. 

The perpendicular distance between the two axes is then |, so that the required second 
moment, M g is given by 

a 3 b , /a\ 2 , , a 3 b 

— = M g + aby-^J =M 9 + ^~ 

Hence, 



2. Determine the second moment of a semi-circular region about an axis through its cen- 

troid, parallel to its diameter. 


2 



Solution 



4 

The second moment of the semi-circular region about its diameter is from Example 
2 in the previous Unit, section 13.11.2. 

Also the position of the centroid, from Example 2 in Unit 13.7, section 13.7.4, is a 
distance of || from the diameter, along the radius which perpendicular to it. 

Hence, 


7r a 4 
~ 8 ~ 




Mg + 


8a 4 

9n 2 


That is, 


Mg 


7T a 4 8a 4 
~8~ ~ 9k 2 ' 


13.12.2 THE PERPENDICULAR AXIS THEOREM 

Suppose l\ and I 2 are two straight lines, at right-angles to each other, in the plane of a region 
R with area A and suppose h\ and /12 are the perpendicular distances from these two lines, 
respectively, of an element 5 A in R. 


3 



5A 


The second moment about l\ is given by 

M x = £ h\8A 

R 

and the second moment about l 2 is given by 

m 2 = Y j h l SA - 

R 

Adding these two together gives the second moment about an axis, perpendicular to the 
plane of R and passing through the point of intersection of l\ and l 2 . This is because the 
square of the perpendicular distance, /i 3 , of 5 A from this new axis is given, from Pythagoras’s 
Theorem, by 

/r 3 — h j -f- h 2 - 

EXAMPLES 

1. Determine the second moment of a rectangular region, R, with sides of length a and b, 
about an axis through one corner, perpendicular to the plane of R. 



4 



Solution 


y 



Using Example 1 in the previous Unit, section 13.11.2, the required 
second moment is 

111 

-a 3 b + -b 3 a = - ab(a 2 + b 2 ). 

o o o 

2. Determine the second moment of a circular region, R, with radius a, about an axis 
through its centre, perpendicular to the plane of R. 

Solution 


y 



The second moment of R about a diameter is, from Example 2 in the previous Unit, 
section 13.11.2, equal to that is, twice the value of the second moment of a semi- 
circular region about its diameter. 


5 



The required second moment is thus 

4 4 4 

7 ra 7 ra tt a 

~T + ^r = ~y ' 

13.12.3 THE RADIUS OF GYRATION OF AN AREA 

Having calculated the second moment of a two dimensional region about a certain axis it is 
possible to determine a positive value, k, with the property that the second moment about 
the axis is given by Ak 2 , where A is the total area of the region. 

We simply divide the value of the second moment by A in order to obtain the value of k 2 
and hence the value of k. 

The value of k is called the “radius of gyration” of the given region about the given axis. 

Note: 

The radius of gyration effectively tries to concentrate the whole area at a single point for the 
purposes of considering second moments; but, unlike a centroid, this point has no specific 
location. 

EXAMPLES 

1. Determine the radius of gyration of a rectangular region, R, with sides of lengths a and 
b about an axis through one corner, perpendicular to the plane of R. 

Solution 


y 


b 


& 


R 


a 


x 


6 



Using Example 1 from the previous section, the second moment is 

- ab(a 2 + b 2 ) 

3 

and, since the area itself is ab, we obtain 

k = V a 2 + b 2 . 

2. Determine the radius of gyration of a circular region, R, about an axis through its 
centre, perpendicular to the plane of R. 

Solution 



4 

From Example 2 in the previous section, the second moment about the given axis is 
and, since the area itself is 7ra 2 , we obtain 


k = 


a 

T* 


7 



13.12.4 EXERCISES 


Determine the radius of gyration of each of the following regions of the xy-plane about the 
axis specified: 

1. Bounded in the first quadrant by the a;- axis, the y-axis and the lines x — a, y — b. 

Axis: Through the point perpendicular to the xy-plane. 

2. Bounded in the hrst quadrant by the x-axis, the y-axis and the lines x = a, y = b. 
Axis: The line x = 4a. 

3. Bounded in the first quadrant by the x-axis, the y-axis and the curve whose equation is 

2,2 2 

x + y = a . 

Axis: Through the origin, perpendicular to the xy-plane. 

4. Bounded in the hrst quadrant by the x-axis, the y-axis and the curve whose equation is 

2,2 2 

x + y — a . 

Axis: The line x = a. 

13.12.5 ANSWERS TO EXERCISES 

1 . 



2 . 

7 a 

vr 

3. 

a 

vr 

4. 

oa/5 
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UNIT 13.13 - INTEGRATION APPLICATIONS 13 


SECOND MOMENTS OF A VOLUME (A) 

13.13.1 INTRODUCTION 

Suppose that R denotes a region (with volume V) in space and suppose that 5V is the 
volume of a small element of this region. 

Then the “second moment” of R about a fixed line, /, is given by 


lim y h 2 6V, 
sv-^o V 

it 


where h is the perpendicular distance from / of the element with volume 8V. 



EXAMPLE 

Determine the second moment, about its own axis, of a solid right-circular cylinder with 
height, h, and radius, a. 


1 



Solution 



In a thin cylindrical shell with internal radius, r, and thickness, Sr, all of the elements of 
volume have the same perpendicular distance, r, from the axis of moments. 

Hence the second moment of this shell will be the product of its volume and r 2 ; that is, 
r 2 (2nrhSr) . 

The total second moment is therefore given by 


lim 'S'' r 2 (2nrhSr) = / 2n hr 3 dr = . 

, r=0 Jo 2 

13.13.2 THE SECOND MOMENT OF A VOLUME OF REVOLUTION ABOUT 
THE Y-AXIS 

Let us consider a region in the first quadrant of the xy- plane, bounded by the x-axis, the 
lines x = a, x = b and the curve whose equation is 


V = f{x). 


2 




a 


Sx b 


The volume of revolution of a narrow ‘strip’, of width, Sx, and height, y, (parallel to the 
y-axis) , is a cylindrical ‘shell’, of internal radius x, height, y, and thickness, Sx. 

Hence, from the example in the previous section, its second moment about the y-axis is 
27 rx 3 ySx. 

Thus, the total second moment about the y-axis is given by 

x =h 

lim 2n x 3 ySx = / 2nx 3 y dx. 

5x—>0 z ^ Jo 

x=a u 


Note: 

Second moments about the x-axis will be discussed mainly in the next section of this Unit; 
but we note that, for the volume of revolution, about the x-axis, of a region in the first 
quadrant bounded by the y-axis, the lines y = c, y = d and the curve whose equation is 


x = g{y), 


we may reverse the roles of x and y so that the second moment about the x-axis is given by 



27ry 3 x dy. 


3 




EXAMPLE 

Determine the second moment, about a diameter, of a solid sphere with radius a. 

Solution 

We may consider, first, the volume of revolution about the y-axis of the region, bounded in 
the first quadrant, by the a;- axis, the y-axis and the circle whose equation is 

2,2 2 
x ~\~ y — ci , 

then double the result obtained. 

y 


x 



4 



The total second moment is given by 


2 [ 2nx 3 \/ a 2 — x 2 dx = An f 2 a 3 sin 3 #.acos#.acos#d#, 
Jo Jo 

if we substitute x = a sin#. 

This simplifies to 


47ra 5 f 2 sin 3 # cos 2 #d# — An f 2 (cos 2 # — cos 4 #) sin #d#, 


if we make use of the trigonometric identity 


sin 2 # = 1 — cos^#. 


The total second moment is now given by 


47ra 5 


cos 3 # 

3 


+ 


cos 5 # 



8n a 5 
~ 15 - ' 


13.13.3 THE SECOND MOMENT OF A VOLUME OF REVOLUTION ABOUT 
THE X-AXIS 

In the introduction to this Unit, a formula was established for the second moment of a solid 
right-circular cylinder about its own axis. This result may now be used to determine the 
second moment about the x-axis for the volume of revolution about this axis of a region 
enclosed in the first quadrant by the x-axis, the lines x = a, x = b and the curve whose 
equation is 


V = f(x). 


5 



a 


5x b 


The volume of revolution about the x-axis of a narrow strip, of width 5x and height y, is a 

4 c 

cylindrical ‘disc’ whose second moment about the x-axis is wy 0 Hence the second moment 
of the whole region about the x-axis is given by 


x=b 





4 

ny 

2 


dx. 


EXAMPLE 

Determine the second moment about the x-axis for the volume of revolution about this axis 
of the region bounded in the first quadrant by the x-axis, the y-axis, the line x = 1 and the 
line whose equation is 

y = x + 1. 
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Solution 



Second moment 



7r(a; + l) 4 
2 


da; 


7T- 


(x + l) 4 

10 


31tt 

To”' 


Note: 

The second moment of a volume about a certain axis is closely related to its “moment of 
inertia” about that axis. In fact, for a solid, with uniform density, p, the Moment of Inertia 
is p times the second moment of volume, since multiplication by p of elements of volume 
converts them into elements of mass. 

13.13.4 EXERCISES 


1. Determine the second moment about a diameter of a circular disc with small thickness, 
t, and radius, a. 

2. Determine the second moment, about the axis specified, for the volume of revolution of 
each of the following regions of the a;p-plane about this axis: 


(a) Bounded in the first quadrant by the a;-axis, the p-axis and the curve whose equation 

is 


y — 1 — 2x 2 . 


Axis: The p-axis. 

(b) Bounded in the first quadrant by the a;-axis and the curve whose equation is 


Axis: The rr-axis. 


y 2 = sin a;. 
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(c) Bounded in the first quadrant by the x-axis, the y-axis, the line x — 1 and the 
curve whose equation is 


y 


= e 


-2x 


Axis: The x-axis 

(d) Bounded in the first quadrant by the x-axis, the y-axis, the line x = 1 and the 
curve whose equation is 


y 


= e 


-2x 


Axis: The y-axis. 


13.13.5 ANSWERS TO EXERCISES 


1 . 

naH 

~1T' 

2. (a) 

71 

24' 

(b) 

7T 2 

T' 


0.196, approximately. 

(d) 


0.337, approximately. 
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UNIT 13.14 - INTEGRATION APPLICATIONS 14 


SECOND MOMENTS OF A VOLUME (B) 

13.14.1 THE PARALLEL AXIS THEOREM 

Suppose that M g denotes the second moment of a given region, R, about an axis, g , through 
its centroid. 

Suppose also that Mi denotes the second moment of R about an axis, /, which is parallel to 
the first axis and has a perpendicular distance of d from the first axis. 



In the above three-dimensional diagram, we have 

M t = Y r 2 5V and M g = Y r 2 g 8V. 

R R 


But, from the Cosine Rule, 


r 2 = r 2 + d 2 — 2r 9 dcos(180° — 6) = r 2 + d 2 + 2r g dcos9. 


Hence, 


r 2 = r 2 + d 2 + 2 dh 


1 



and so 


£ r 2 5V = £ r 2 g SV + £ d 2 <SU + 2 MU. 


Finally, the expression 


£ MU 

R 

represents the first moment of R about a plane through the centroid, which is perpendicular 
to the plane containing l and g. Such first moment will be zero and hence, 

M x = M g + Vd 2 . 


EXAMPLE 

Determine the second moment of a solid right-circular cylinder about one of its generators 
(that is, a line in the surface, parallel to the central axis). 

Solution 



The second moment of the cylinder about the central axis was shown, in Unit 13.13, section 
13.13.2, to be and, since this axis and the generator are a distance a apart, the required 
second moment is given by 


7T a A h 
2 


( na 2 h)a 2 


3n a A h 
2 


2 




13.14.2 THE RADIUS OF GYRATION OF A VOLUME 


Having calculated the second moment of a three-dimensional region about a certain axis, it 
is possible to determine a positive value, k, with the property that the second moment about 
the axis is given by Vk 2 , where V is the total volume of the region. 

We simply divide the value of the second moment by V in order to obtain the value of k 2 
and hence, the value of k. 

The value of k is called the “radius of gyration” of the given region about the given axis. 

Note: 

The radius of gyration effectively tries to concentrate the whole volume at a single point 
for the purposes of considering second moments; but, unlike a centroid, this point has no 
specific location. 

EXAMPLES 

1. Determine the radius of gyration of a solid right-circular cylinder with height, h, and 
radius, a, about (a) its own axis and (b) one of its generators. 

Solution 



Using earlier examples, together with the volume, V = na 2 h, the required radii of 
gyration are (a) \J 4- ira 2 h = ^ and (b) 4. -ji a 2 h = a^J\. 

2. Determine the radius of gyration of the volume of revolution about the x-axis of the 
region, bounded in the first quadrant by the x-axis, the y-axis, the line x = 1 and the 
line whose equation is 

y = x + 1. 


3 



Solution 



From Unit 13.13, section 13.13.3, the second moment about the given axis is 
The volume itself is given by 


[ 7r(x+l )" 
Jo 


da; 


( x + 1) 3 1 1 

TT 


7tt 

~3~' 


Hence, 


, 2 3l7r 3 93 

k = x — = — . 

10 7tt 70 


That is, 


k = 



1.15 


317 r 
10 ’ 
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13.14.3 EXERCISES 


1. Determine the radius of gyration of a hollow cylinder with internal radius, a, and 
external radius, b , about 

(a) its central axis; 

(b) a generator lying in its outer surface. 

2. Determine the radius of gyration of a solid hemisphere, with radius a, about 

(a) its base-diameter; 

(b) an axis through its centroid, parallel to its base-diameter. 

3. For a solid right-circular cylinder with height, h, and radius, a, determine the radius of 
gyration about 

(a) a diameter of one end; 

(b) an axis through the centroid, perpendicular to the axis of the cylinder. 

4. For a solid right-circular cone with height, h, and base-radius, a, determine the radius 
of gyration about 

(a) the axis of the cone; 

(b) a line through the vertex, perpendicular to the axis of the cone; 

(c) a line through the centroid, perpendicular to the axis of the cone. 


5 



13.14.4 ANSWERS TO EXERCISES 



6 
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UNIT 13.15 - INTEGRATION APPLICATIONS 15 


SECOND MOMENTS OF A SURFACE OF REVOLUTION 

13.15.1 INTRODUCTION 

Suppose that C denotes an arc (with length s) in the xy-plane of cartesian co-ordinates, and 
suppose that 5s is the length of a small element of this arc. 

Then, for the surface obtained when the arc is rotated through 2n radians about the x-axis, 
the “second moment” about the x-axis, is given by 

lim y 2 .27ry5s. 

8 s — >0 

13.15.2 INTEGRATION FORMULAE FOR SECOND MOMENTS 

(a) Let us consider an arc of the curve whose equation is 


V = f 0), 

joining two points, P and Q, at x = a and x = 6, respectively. 



The arc may be divided up into small elements of typical length, 5s , by using neighbouring 
points along the arc, separated by typical distances of 5x (parallel to the x-axis) and 5y 
(parallel to the y-axis) . 
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From Pythagoras’ Theorem, 



so that, for the surface of revolution of the arc about the a;- axis, the second moment becomes 



Note: 

If the curve is given parametrically by 

x = x(t), y = y(t ), 

then 

d9 = t 

A y dx ' 

UX d t 

Hence, 


d t 

provided that is positive on the arc being considered. If not, then the above line needs 
to be prefixed by a negative sign. 

From the technique of integration by substitution, 



where t = ti when x = a and t = t 2 when x = b. 




2 




We may conclude that the second moment about the plane through the origin, perpendicular 
to the a;-axis, is given by 


second moment 



dt 


according as is positive or negative, 
(b) For an arc whose equation is 


x = g{y), 


contained between y = c and y = d, we may reverse the roles of x and y in the previous 
section so that the second moment about the y-axis is given by 




2 


d y 



Note: 

If the curve is given parametrically by 


x = x{t), y = y(t), 


3 



where t = t\ when y = c and t — t ,2 when y = d, then the second moment about the y-axis 
is given by 


second moment 






dt, 


according as is positive or negative. 

EXAMPLES 

1. Determine the second moment about the x-axis of the hemispherical surface of revolu- 
tion (about the x-axis) of the arc of the circle whose equation is 

2,2 2 

x d - y — ( i ■ 

lying in the first quadrant. 

Solution 



Using implicit differentiation, we have 


2x + 2 y 


d y 

dx 


0 


and, hence, ^ 

1 dx y 


4 



The second moment about the a;-axis is therefore given by 


ra / rp 2 pa / 

L 2 ny 3 f + s dx= l 


3 x 2 + y 2 


y 


da;. 


But x 2 + y 2 = a 2 , and so the second moment becomes 


ra 

/ 27t a(a 2 — x 2 ) dx = 2na 

Jo 


rp 

O <X/ 

a x ~ y 


47ra 4 


2. Determine the second moment about the axis of revolution, when the first quadrant 
of the curve with parametric equations 

x = acos 3 d, y = asin 3 # 


is rotated through 27 t radians about (a) the x-axis and (b) the y-axis. 

Solution 



(a) Firstly, we have 


— = — 3acos 2 dsind and = 3asin 2 0cosd. 
d 6 d 0 


Hence, the second moment about the x-axis is given by 


— j 2iry 3 \/9a 2 cos‘ l dsm 2 d + 9a 2 sin 4 6*cos 2 6* c!6>, 


which, on using cos 2 6 + sin 2 d = 1, becomes 


arc 


5 



27ra 3 sin 2 ' 9.3a cos 9 sin 6 d9 = / 67ra 4 sin 28 # cos # dO 


= 6na 4 / ~ sin 28 # cos 0 d#, 


which, by the methods of Unit 12.7 gives 

„ q ,"sin 29 # 
bn a 


29 


o 


67ra 4 

” 29 ” 


(b) By symmetry, or by direct integration, the second moment about the y - axis is also 

67ra 4 
29 ' 


13.15.3 THE RADIUS OF GYRATION OF A SURFACE OF REVOLUTION 

Having calculated the second moment of a surface of revolution about a specified axis, it is 
possible to determine a positive value, k , with the property that the second moment about 
the axis is given by Sk 2 , where S is the total surface area of revolution. 

We simply divide the value of the second moment by S in order to obtain the value of k 2 
and hence the value of k. 

The value of k is called the “radius of gyration” of the given arc about the given axis. 

Note: 

The radius of gyration effectively tries to concentrate the whole surface at a single point 
for the purposes of considering second moments; but, unlike a centroid, this point has no 
specific location. 

EXAMPLES 


1. Determine the radius of gyration about the x-axis of the surface of revolution (about 
the x-axis) of the arc of the circle whose equation is 


2,2 2 
x y — a . 


lying in the first quadrant. 
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Solution 



From an Example 1 in section 13.15.2, we know that the second moment of the surface 
about the x-axis is equal to 

Also, the total surface area is 


ra / % 2 pa 

/ 2ny\ 1 H — - da: = / 2na dx = 2n a 2 , 

Jo V V J o 


which implies that 


, o 47ra 4 1 2 a 2 

h = x = . 

3 27ra 2 3 


The radius of gyration is thus given by 


k = a 

2. Determine the radius of gyration about the axaxis of the surface of revolution (about 
the axaxis) of the first quadrant arc of the curve with parametric equations 

x = acos 3 d, y = as in 3 (9. 
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Solution 



We know from Example 2 in section 13.15.2, that that the second moment of the surface 
about the x-axis is equal to 67ra 


29 • 


Also, the total surface area is given by 


— / 27rasiiT 1 6 ) .3acos6 , sin0 d9 = I 3a 2 sin 4 # cos 6 dO = 3na 


sin 5 6' 


3tt a 2 


Thus, 


k 2 = 


67 ra 4 


x 


5 10a 2 


29 3vra 2 29 


13.15.4 EXERCISES 

1. Determine the second moment, about the x-axis, of the surface of revolution (about the 
x-axis) of the straight-line segment joining the origin to the point (2, 3). 

2. Determine the second moment about the x-axis, of the surface of revolution (about the 
x-axis) of the first quadrant arc of the curve whose equation is y 2 = 4x, lying between 
x = 0 and x = 1. 

3. Determine, correct to two places of decimals, the second moment about the y-axis, of 

the surface of revolution (about the y-axis) of the first quadrant arc of the curve whose 

equation is 3 y = x 3 , lying between x = 1 and x = 2. 



4. Determine, correct to two places of decimals, the second moment, about the y- axis, of 
the surface of revolution (about the y- axis) of the arc of the circle given parametrically 
by 

x = 2 cos t, y = 2 sin t, 
joining the point (y/2, y/2) to the point (0, 2). 

5. Determine the radius of gyration of a hollow right-circular cone with maximum radius, 
a, about its central axis. 

6. For the curve whose equation is 9 y 2 = x(3 — x) 2 , show that 

d y 1 — x 

dx 2y / x 

Hence, show that the radius of gyration about the y-axis of the surface obtained when 
the first quadrant arch of this curve is rotated through 27r radians about the x-axis is 
4, correct to the nearest whole number. 


13.15.5 ANSWERS TO EXERCISES 


1 . 


2 . 


7t27\/T3 
2 ' 

— [4- y/2] ~ 51.99 


3. 

4. 


70.44 


0.73 


5. 


6 . 


k = 


a 

V2' 


Second moment ~ 139.92, surface area ~ 9.42 
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UNIT 13.16 - INTEGRATION APPLICATIONS 16 


CENTRES OF PRESSURE 

13.16.1 THE PRESSURE AT A POINT IN A LIQUID 

In the following diagram, we consider the pressure in a liquid at a point, P, whose depth 
below the surface of the liquid is z. 


Surface 


z 


P 


Ignoring atmospheric pressure, the pressure, p, at P is measured as the thrust acting upon 
unit area and is due to the weight of the column of liquid with height z above it. 

Hence, 


p = wz 

where w is the weight, per unit volume, of the liquid. 

Note: 

The pressure at P is directly proportional to the depth of P below the surface; and we shall 
assume that the pressure acts equally in all directions at P. 
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13.16.2 THE PRESSURE ON AN IMMERSED PLATE 


We now consider a rectangular plate, with dimensions a and (ci 2 — hi), immersed vertically 
in a liquid as shown below. 


Surface 


d\ 



For a thin strip, PQ, of width, 6z , at a depth, z, below the surface of the liquid, the thrust 
on PQ will be the pressure at P multiplied by the area of the strip; that is, wz x adz. 

The total thrust on the whole plate will therefore be 


z=d 2 

y wazdz. 

z=d\ 


Allowing Sz to tend to zero, the total thrust becomes 


rd,2 

waz 2 

/ waz d z = 


Idi 

2 




This may be written 


wa(d 2 



5 


where, in this form, a(d 2 — d\) is the area of the plate and d2 + dl is the depth of the centroid 
of the plate. 


2 



We conclude that 


Total Thrust = Area of Plate x Pressure at the Centroid. 


Note: 

It may be shown that this result holds whatever the shape of the plate is; and even when 
the plate is not vertical. 

EXAMPLES 

1. A triangular plate is immersed vertically in a liquid for which the weight per unit volume 
is w. The dimensions of the plate and its position in the liquid is shown in the following 
diagram: 


Surface 



Determine the total thrust on the plate as a multiple of w. 

Solution 

The area of the plate is given by 


Area = - x 4 x 2 = 4cm 2 
2 

The centroid of the plate, which is at a distance from its horizontal side equal to one 
third of its perpendicular height, will lie at a depth of 


5 

1 H — x 2 ) cms. = -cms. 


Hence, the pressure at the centroid is ^ and we conclude that 


_ _ , , 5w 2C )w 

total thrust = 4 x — = . 

3 3 
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2. The following diagram shows a rectangular plate immersed in a liquid for which the 
weight per unit volume is w\ and the plate is inclined at 30° to the horizontal: 


Surface Surface 



Determine the total thrust on the plate as a multiple of w. 

Solution 

The depth, z, of the centroid, G, of the plate is given by 

z — di + - sin 30° = di + -. 
Hence, the pressure, p, at G is given by 



and, since the area of the plate is ab , we obtain 


Total thrust 


ab 



w. 
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13.16.3 THE DEPTH OF THE CENTRE OF PRESSURE 


In this section, we consider again an earlier diagram for a rectangular plate, immersed 
vertically in a liquid whose weight per unit volume is w. 


Surface 


d\ 



We have already seen that the total thrust on the plate is 



waz d z = w 



az dz 


and is the resultant of varying thrusts, acting according to depth, at each level of the plate. 

But, by taking first moments of these thrusts about the line in which the plane of the plate 
intersects the surface of the liquid, we may determine a particular depth at which the total 
thrust may be considered to act. 

This depth is called “the depth of the centre of pressure”. 

The Calculation 

In the diagram, the thrust on the strip PQ is wazSz and its first moment about the line in 
the surface is waz 2 8z so that the sum of the first moments on all such strips is given by 


z=d 2 

Y, waz 2 8z = w 

z=d\ 


rd 2 


] d 1 


az 2 dz 


where the definite integral is, in fact, the second moment of the plate about the line in the 
surface. 
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Next, we define the depth, C p , of the centre of pressure to be such that 


Total thrust x C p = sum of first moments of strips like PQ. 


That is, 


and, hence, 


r d 2 rd 2 

w az dz x C p = w az d z 
Jd i Jd 1 


az 2 d; 

" /* 2 a* d* ’ 

which may be interpreted as 

_Ak 2 _ k 2 


where A is the area of the plate, k is the radius of gyration of the plate about the line in the 
surface of the liquid and z is the depth of the centroid of the plate. 

Notes: 

(i) It may be shown that the formula 


a 


e 

z 


holds for any shape of plate immersed at any angle. 

(ii) The phrase, “centre of pressure” suggests a particular point at which the total thrust is 
considered to act; but this is simply for convenience. The calculation is only for the depth 
of the centre of pressure. 
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EXAMPLE 


Determine the depth of the centre of pressure for the second example of the previous section. 

Solution 



The depth of the centroid is 


d\ + 


b 

4 


and the square of the radius of gyration of the plate about an axis through the centroid, 

2 

parallel to the side with length a is 

Furthermore, the perpendicular distance between this axis and the line of intersection of the 
plane of the plate with the surface of the liquid is 


b d\ 

2 + sin 30° 


b 

2 


+ 2d\. 


ffence, the square of the radius of gyration of the plate about the line in the surface is 


12 


+ 




2 


using the Theorem of Parallel Axes. 
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Finally, the depth of the centre of pressure is given by 


_ 12 + (I + 2rfl ) 

^1 + | 


13.16.4 EXERCISES 

1. A thin equilateral triangular plate is immersed vertically in a liquid for which the 
weight per unit volume is w, with one edge on the surface. If the length of each side is 
a, determine the total thrust on the plate. 

2. A thin plate is bounded by the arc of a parabola and a straight line segment of length 
1.2m perpendicular to the axis of symmetry of the parabola, this axis being of length 
0.4m. 

If the plate is immersed vertically in a liquid with the straight edge on the surface, 
determine the total thrust on the plate in the form Iw, where w is the weight per unit 
volume of the liquid and / is in decimals, correct to two places. 

3. A thin rectangular plate, with sides of length 10cm and 20cm is immersed in a liquid so 
that the sides of length 10cm are horizontal and the sides of length 20cm are incline at 
55° to the horizontal. If the uppermost side of the plate is at a depth of 13cm, determine 
the total thrust on then plate in the form Iw , where w is the mass per unit volume of 
the liquid. 

4. A thin circular plate, with diameter 0.5m is immersed vertically in a tank of liquid so 
that the uppermost point on its circumference is 2m below the surface. Determine the 
depth of the centre of pressure, correct to two places of decimals. 

5. A thin plate is in the form of a trapezium with parallel sides of length lm and 2.5m, a 
distance of 0.75m apart, and the remaining two sides inclined equally to either one of 
the parallel sides. 

If the plate is immersed vertically in water with the side of length 2.5m on the surface, 
calculate the depth of the centre of pressure, correct to two places of decimals. 



13.16.5 ANSWERS TO EXERCISES 


1 . 


Total thrust 


wa 3 


2 . 

3. 

4. 


Total Thrust = 5.12-tu. 
C p ~ 2.26m. 

C p ~ 0.46m. 
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UNIT 14.1 - PARTIAL DIFFERENTIATION 1 - PARTIAL DERIVATIVES OF 
THE FIRST ORDER 

14.1.1 FUNCTIONS OF SEVERAL VARIABLES 

In most scientific problems, it is likely that a variable quantity under investigation will 
depend (for its values), not only on one other variable quantity, but on several other 
variable quantities. 

The type of notation used may be indicated by examples such as the following: 

1 . 


* = f(x,y), 

which means that the variable, z, depends (for its values) on two variables, x and y. 

2 . 


w = F(x,y,z), 

which means that the variable, w, depends (for its values) on three variables, x, y and 

z. 

Normally, the variables on the right-hand side of examples like those above may be 
chosen independently of one another and, as such, are called the “independent vari- 
ables”. By contrast, the variable on the left-hand side is called the “dependent 
variable” . 


Notes: 

(i) Some relationships between several variables are not stated as an explicit formula for 
one of the variables in terms of the others. 

An illustration of this type would be x 2 + y 2 + z 2 = 16. 

In such cases, it may be necessary to specify separately which is the dependent variable. 

(ii) The variables on the right-hand side of an explicit formula, giving a dependent variable 
in terms of them, may not actually be independent of one another. This would occur if those 
variables were already, themselves, dependent on a quantity not specifically mentioned in 
the formula. 
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For example, in the formula 


z = xy 2 + sin(x — y), 

suppose it is also known that x — t — 1 and y = 3t + 2. 

Then the variables x and y are not independent of each other. In fact, by eliminating t, we 
obtain 


y = 3(x + 1) + 2 = 3a; + 5. 

14.1.2 THE DEFINITION OF A PARTIAL DERIVATIVE 
ILLUSTRATION 

Consider the formulae for the volume, V, and the surface area, S, of a solid right- circular 
cylinder with radius, r, and height, h. 


! \r\ 


l / 

\ i 
\ I 
\ / 
\/ 
v 


hr 


The relevant formulae are 


V = irr 2 h and S = 2irr 2 + 2n rh, 
so that both V and S are functions of the two variables, r and h. 

But suppose it were possible for r to be held constant while h is allowed to vary. Then the 
corresponding rates of increase of V and S with respect to h are given by 


dV 

d h 


= 7T r 


r const. 
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and 


dS 

dh 


= 2nr. 


r const. 


These two expressions are called the “partial derivatives of V and S with respect to 

h”. 

Similarly, suppose it were possible for h to be held constant while r is allowed to vary. Then 
the corresponding rates of increase of V and S with respect to r are given by 


dV 

dr 


= 2-7T rh 


h const. 


and 


dS 


dr 


J h const. 


47 rr + 2nh. 


These two expressions are called the “partial derivatives of V and S with respect to 

r ii 

THE NOTATION FOR PARTIAL DERIVATIVES 

In the defining illustration above, the notation used for the partial derivatives of V and 
S was an adaptation of the notation for what will, in future, be referred to as ordinary 
derivatives. 

It was, however, rather cumbersome; and the more standard notation which uses the symbol 
d rather than d is indicated by restating the earlier results as 


dV 2 dS 

= 7 r-r — 

dh ’ dh 


2nr 


and 


dV 

dr 


2nrh, 


dS 

dr 


Anr + 2irh. 
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In this notation, it is understood that each independent variable (except the one with respect 
to which we are differentiating) is held constant. 

EXAMPLES 


1. Determine the partial derivatives of the following functions with respect to each of the 
independent variables: 

(a) 

z = {x 2 + 3 yf] 


Solution 


and 


(b) 


— 1 = 5(x 2 + 3y) A .2x = lOada; 2 + 3 y) 4 
ox 

r) 7 

— = 5(a; 2 + 3yf.3 = 15(a; 2 + 3 y) 4 . 


w = ze 3x ~ 7y : 


Solution 


dw 

dx 


3 ze 3x ~ 7y , 


and 


Solution 


and 


dw 

dy 


-7 ze 3x ~ 7y , 


dw 

dz 


e 3x ~7 y 


z = x sin (2a: 2 + by). 


dz 

dx 


sin(2a; 2 + by) + 4a; 2 cos(2a; 2 + by) 


-7- = 5a;cos(2a; 2 + by), 
dy 
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2. If 

show that 


Solution 


and 


Hence, 


3. Given the formula 


* = f(x 2 + y 2 ). 


dz dz 

x ~ r~yjr = °- 

ay dx 


dz 

dx 


2 xf\x 2 + y 2 ) 


- = 2 yf(x 2 + y 2 ). 
dy 


dz dz 


X dy V dx 


0 . 


cos(a: + 2 z) + 3 y 2 + 2 xyz = 0 


as an implicit relationship between two independent variables x and y and a dependent 
variable z, determine expressions for and in terms of x, y and z. 

Solution 

Differentiating the formula partially with respect to x and y in turn, we obtain 


and 


sin (a: + 2 z). (l + 2^^ + 2 y + y \ — 0 


dz ( dz \ 

- sin(x + 2z).2— + 6y + 2x ( y— + zj =0, 


respectively. 

Thus, 

dz sin(x + 2 z) — 2 y 2 
dx 2 yx — 2 sin(x + 2 z) 
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and 


dz 2 xz + 6 y xz + 3 y 

dy 2 sin (a: + 2 z) — 2 xy sin(x + 2 z) — xy 


14.1.3 EXERCISES 


1. Determine and in the following cases: 


(b) 

(c) 

(d) 


2. If 


show that 


z = 2a; 2 — 4 xy + y 3 ; 
z = cos (5a; — 3 y); 


z = e x2+2y 2 ; 


z = x sin(y — a;). 


* = (x + y) In , 


dz dz 

+ y a = z. 
dx dy 

3. Determine and ^ in the following cases: 


w = a; s + 3xyx + z 2 \ 


(b) 

(c) 


w = ze 2x ~ 3y : 


w = sin(a; 2 — yz). 
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14.1.4 ANSWERS TO EXERCISES 


1. (a) 


(b) 


(c) 


(d) 


2 . 


3. (a) 


(b) 


(c) 


— = Ax — Ay and — = —Ax + 3 y ; 
ox ay 


dz 

dx 


—5 sin(5x 


3 y) and 


dz 

dy 


3 sin(5x — 3 y); 


dz 

dx 


2xe x2+2y2 and ^ = Aye x2+2y2 ; 
dy 


dz 

dx 


sin (y — x) 


x cos (y 


x) 


and 


dz 

dy 


x cos (y — x). 


dz 

dx 


^ + x + y 


and 


dz 

dy 



x + y 
y 


dw 

dx 


5a: 4 + 3 yz, 


dw 

dy 


3 xz, 


dw 

dz 


3 xy + 2 z; 


dw 

dx 


2ze^\ 9 + 
dy 


-zzJ*-\ d + 

dz 


e 2x-3y. 


dw 

dx 


dw 


dw 


= 2a;cos(ar — yz), — — = — zcos(ar — yz), — — = —y cos(ar — yz). 


dy 


dz 
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UNIT 14.2 - PARTIAL DIFFERENTIATION 2 


PARTIAL DERIVATIVES OF THE SECOND AND HIGHER ORDERS 
14.2.1 STANDARD NOTATIONS AND THEIR MEANINGS 


In Unit 14.1, the partial derivatives encountered are known as partial derivatives of the first 
order; that is, the dependent variable was differentiated only once with respect to each 
independent variable. 

But a partial derivative will, in general contain all of the independent variables, suggesting 
that we may need to differentiate again with respect to any of those variables. 

For example, in the case where a variable, z, is a function of two independent variables, x 
and y , the possible partial derivatives of the second order are 


d 2 z 
dx 2 ’ 


which means 


d fdz\ 

dx y dx J ’ 


n 


(iii) 


(iv) 


d 2 z 

tyf 2 ' 


which means 


d (dz\ 

dy ych/ J ’ 


d 2 z 
dxdy ’ 


which means 


d (dz\ 
dx \dy ) ’ 


d 2 z 
dydx ’ 


which means 


d f dz\ 
dy \dx ) 


The last two can be shown to give the same result for all elementary functions likely to be 
encountered in science and engineering. 

Note: 


Occasionally, it may be necessary to use partial derivatives of order higher than two, as 
illustrated, for example, by 


d 3 z 

dxdy 2 ’ 


which means 


d 

' d 

(dzY 

dx 

dy 

\9yJ_ 


1 



and 


d A z 

dx 2 d y 2 ’ 


which means 



8 fdz ' 
dy \dy j 


EXAMPLES 

Determine all the first and second order partial derivatives of the following functions: 
1 . 


Solution 


z = 7x 3 

— 5 x 2 y + 6 y 3 . 

|| = 21a; 2 — 10 xy; 

= —5a; 2 + 18a/ 2 

£f = 42a; - 10 y; 

W = 36 « 

f* = 10a;; 

oyox 

05 

1 

o 


2 . 


z — y sin x + x cos y. 


Solution 


|| = y cos x + cos y; 


C17 

tt- — smx — xsmy; 


a 2 z 

dx 2 


—y sin a;; 


d 2 z 

= -a; cos y; 


d 2 Z 

dydx 


cos x — sin y, 


a 2 z 

dxdy 


cos x — sin y. 
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3 . 


z = e xy {2 x-y). 


Solution 


I = e xy Wx - y) + 2] 
= e xy [2 xy — y 2 + 2] ; 

0 = e*%(2a;j/ - y 2 + 2) + 2y] 
= e x ^[2a;?/ 2 — y 3 + 1//]; 

= e xy [x(2 xy - y 2 + 2) + 2x - 2 y] 
= e xy [2x 2 y — xy 2 + 4a; — 2 a/] ; 

14.2.2 EXERCISES 


% = e xy [x(2x - y) - 1] 
= e xy [2x 2 — xy — 1]; 

c>z _ e xy[ x (2x 2 — xy — 1) — x] 
= e xy [2x 3 — x 2 y — 2a;]; 

Sk = eXy [y( 2x2 - xy - 1) + Ax -y] 
= e xy [2 x 2 y — xy 2 + 4a; — 2 y\. 


1. Determine all the first and second order partial derivatives of the following functions: 

(a) 

z = 5 x 2 y 3 — 7a; 3 ?/ 5 ; 

(b) 

z = x 4 sin 3 y. 

2. Determine all the first and second order partial derivatives of the function 

w = z 2 e xy + x cos (y 2 z). 


3. If 


show that 


z = (x + y) In 



5 


dx 2 



d ' 2 _z_ 
dy 2 


3 



4. If 

show that 


z = f(x + ay) + F(x - ay), 

d 2 z _ 1_ 5 0 
dx 2 a 2 d y 2 


14.2.3 ANSWERS TO EXERCISES 

1. (a) The required partial derivatives are as follows: 

= 10 xy 3 — 21 x 2 y 5 ; = 15 x 2 y 2 — 35 x 3 y 4 ; 


0 = 10 y 3 - 42 xy 5 ; ff = 30 x 2 y - 140a; V; 


= 30 xy 2 — 105 x 2 y A ] -^4- = 30 xy 2 — 105 x 2 y A . 

oyox & & j axoy & & 


(b) The required partial derivatives are as follows: 


|| = 4a; 3 sin 3 y; = 3a; 4 cos 3 y; 

y-§ = 12a; 2 sin 3 y, 0 = — 9a; 4 sin3?/; 


d 2 z 

dydx 


12a; 3 cos 3 y, 


d 2 Z 

dxdy 


12a; 3 cos 3 y. 


2. The required partial derivatives are as follows: 


dw 

dx 


= yz 2 e xy + cos (y 2 z); 


dw 

dy 


xe 


xy 


2 xyz sin(y 2 ^); 


dw 

dz 


2 ze xy — xy 2 sin (y 2 z); 


d 2 w 

dx 2 


y 2 z 2 e xy ; 


d 2 w 

dy 2 


z 2 x 2 e xy — 2xz sin(y 2 z)+4xy 2 z 2 cos (y 2 z)\ 


d 2 w 

dz 2 


2 e xy —xy 4 cos (y 2 z) 
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d 2 w 

dxdy 


d 2 w 
dydx 


= z 2 e xy + z 2 xye xy 


2yzsin(y 2 z); 


d 2 w 
dydz 


d 2 w 

dzdy 


2 zxe xy — 2xysai[y 2 z) — 2xy 3 z cos(y 2 z); 


3 . 


4 . 


d 2 w 

dzdx 


d 2 w 

dxdz 


2 zye xy -y 2 sin (y 2 z). 


d 2 z 
dx 2 


1 

x 


4 and 
x 2 


d *z 
dy 2 


1 x 
- + 
y y 


d 2 z 
dx 2 


f"(x + ay) + F"{x 


ay) and 


6 ?z 
dy 2 


a 2 f"{x + ay) + a 2 F"{x — ay). 
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UNIT 14.3 - PARTIAL DIFFERENTIATION 3 


SMALL INCREMENTS AND SMALL ERRORS 

14.3.1 FUNCTIONS OF ONE INDEPENDENT VARIABLE - A RECAP 

For functions of one independent variable, a discussion of small increments and small errors 
has already taken place in Unit 11.6. 

It was established that, if a dependent variable, y, is related to an independent variable, x, 
by means of the formula 


V = f(x), 


then 

(a) The increment, 8y, in y, due to an increment of 8x, in x is given (to the first order of 
approximation) by 


8y ~ 


dy 

da; 


8x ; 


and, in much the same way, 

(b) The error, 8y, in y, due to an error of 8x in x, is given (to the first order of approximation) 
by 


8y — 


dy 

da; 


8x. 


14.3.2 FUNCTIONS OF MORE THAN ONE INDEPENDENT VARIABLE 

Let us consider, first, a function, z, of two independent variables, x and y, given by the 
formula 


z = f{x,y). 

If x is subject to a small increment (or a small error) of 8x, while y remains constant, then 
the corrresponding increment (or error) of 8z in z will be given approximately by 
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Sz 


dz 

dx 


5x. 


Similarly, if y is subject to a small increment (or a small error) of Sy, while x remains 
constant, then the corresponding increment (or error) of 5z in z will be given approximately 
by 


dz 

8z ~ — 8y. 
dy 


It seems reasonable to assume, therefore, that, when x is subject to a small increment (or a 
small error) of 8x and y is subject to a small increment (or a small error) of 8y, then the 
corresponding increment (or error) of 8z in z will be given approximately by 


r dz . dz . 

dz ~ ~ — ox + — dy. 
dx dy 


It may be shown that, to the first order of approximation, this is indeed true. 

Notes: 

(i) To prove more rigorously that the above result is true, use would have to be made of the 
result known as “Taylor’s Theorem” for a function of two independent variables. 

In the present case, where z — f(x,y), it would give 


f(x + 8x,y + 8y ) = f(x,y)+(^8x + + 


dx 


dy 


' d 2 z 
dx 2 


d 2 


d 2 z 


(te) + 2——8x8y+—(8y) +. 


dxdy 


dy 2 


which shows that 


8z = f(x + 8x,y + 8y) - f(x, y) ~ 


dz 

+ 

dx 


dz 

dy 


8y 


to the first order of approximation. 

(ii) The formula for a function of two independent variables may be extended to functions 
of a greater number of independent variables by simply adding further appropriate terms to 
the right hand side. 
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For example, if 


w = F(x,y,z), 


then 


. dw . dw dw 

ow ~ — — ox + — — oy + — — oz . 
ox oy oz 


EXAMPLES 

1. A rectangle has sides of length xcms. and ycms. 

Determine, approximately, in terms of x and y, the increment in the area, A, of the 
rectangle when x and y are subject to increments of 5x and Sy, respectively. 

Solution 

The area, A, is given by 


so that 


A = xy, 


d A d A 

oA ~ — — ox + ~^—oy 
ox oy 


ySx + x8y. 


Note: 

The exact value of 5 A may be seen in the following diagram: 
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5y 


y 

cc dx 

The difference between the approximate value and the exact value is represented by the 
area of the small rectangle having sides fecms. and 5y cms. 

2. In measuring a rectangular block of wood, the dimensions were found to be lOcms., 
12cms and 20cms. with a possible error of ±0.05cms. in each. 

Calculate, approximately, the greatest possible error in the surface area, S, of the block 
and the percentage error so caused. 

Solution 

First, we may denote the lengths of the edges of the block by x, y and z. 



x 


The surface area, S, is given by 

S = 2 (xy + yz + zx ), 

which has the value 1120cms 2 when x = lOcms., y = 12cms. and z = 20cms. 
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Also, 


rrr dS f dS f f 
AS ~ —Ac + — Sy + -£-oz, 
Ox ay oz 


which gives 


5S ~ 2(y ± z)5x ± 2(x + z)Sy + 2(y ± x)Az; 

and, on substituting x — 10, y = 12, z — 20, 5x = ±0.05, Sy = ±0.05 and Sz = ±0.05, 
we obtain 


SS ~ ±2(12 ± 20)(0.05) ± 2(10 ± 20)(0.05) ± 2(12 ± 10)(0.05). 

The greatest error will occur when all the terms of the above expression have the same 
sign. Hence, the greatest error is given by 

AS'max ^ ±8.4cms. 2 ; 

and, since the originally calculated value was 1120, this represents a percentage error 
of approximately 


± 


8.4 

1120 


x 100 


±0.75 


calculate, approximately, the percentage error in w when x is too small by 3%, y is too 
large by 1% and z is too large by 2%. 

Solution 

We have 


That is, 


. dw dw dw 
ow ~ — — ox ± - 7 T-oy ± — — oz . 
ox oy oz 


dw ~ 


3arz 

y 4 


4 x 3 z 


x ° 


5x —Sy + -rSz, 


V 0 


IF 


where 
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Sx = — , 8y = and 5z 

100’ y 100 


Thus, 


8w — 


x 3 z 


y 


9 4 2 

100 _ 100 + 100 


The percentage error in w is given approximately by 


5w 

— x 100 = -11. 

w 


That is, w is too small by approximately 11%. 


2z 

loo' 


11 w 
100 ' 


14.3.3 THE LOGARITHMIC METHOD 

In this section we consider again examples where it is required to calculate either a percentage 
increment or a percentage error. 

We may conveniently use logarithms if the right hand side of the formula for the dependent 
variable involves a product, a quotient, or a combination of these two in which the inde- 
pendent variables are separated. This would be so, for instance, in the final example of the 
previous section. 

The method is to take the natural logarithms of both sides of the equation before considering 
any partial derivatives; and we illustrate this, firstly, for a function of two independent 
variables. 

Suppose that 


^ = f{x,y) 

where f(x,y ) is the type of function described above. 

Then, 

lnz = In f(x,y); 

and, if we temporarily replace \wz by w, we have a new formula 
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w = In f(x,y). 


The increment (or the error) in w, when x and y are subject to increments (or errors) of 5x 
and Sy respectively, is given by 


. dw dw 
ow ~ — — ox + -z—oy. 
ox ay 


That is, 


5w ~ 


df 


8x + 


df 


5y = 


f(x, y) dx f(x,y)dy f(x,y) 


df df 

-rf-dx + -^-8y 
ox ay 


In other words, 


Sw — 


dz dz 

-w-ox+ —dy 

ox oy 


We conclude that 


V UJ 7 

z 

which means that the fractional increment (or error) in z approximates to the actual incre- 
ment (or error) in In z. Multiplication by 100 will, of course, convert the fractional increment 
(or error) into a percentage. 

Note: 

The logarthmic method will apply equally well to a function of more than two independent 
variables where it takes the form of a product, a quotient, or a combination of these two. 


7 



EXAMPLES 


calculate, approximately, the percentage error in w when x is too small by 3%, y is too 
large by 1% and z is too large by 2%. 

Solution 

Taking the natural logarithm of both sides of the given formula, 


In w — 3 In x + In z — 4 In y, 


giving 

8w Sx Sz 1 5y 

— ~ 3 1 4 — , 

w x z y 

where 

5x 3 Sy 1 8z 2 

7 - “lOO’ ~y ~ Too an "7 “ loo' 


Hence, 


5w 

w 


x 100 


-9 + 2 - 4 = -13. 


Thus, w is too small by approximately 11%, as before. 
2. In the formula, 


w = 



x is subjected to an increase of 2%. Calculate, approximately, the percentage change 
needed in y to ensure that w remains unchanged. 

Solution 

Taking the natural logarithm of both sides of the formula, 



Hence, 


Sw 1 


w ~ 2 



6y 

y 


where — = 0.02, and we require that 5w = 0. 
Thus, 


0 = 


1 

2 


0.06 


hi 

y 




giving 


hi 

y 


0.06, 


which means that y must be approximately 6% too large. 


14.3.4 EXERCISES 


1. A triangle is such that two of its sides (of length 6cms. and 8cms.) are at right-angles 
to each other. 

Calculate, approximately, the change in the length of the hypotenuse of the triangle 
when the shorter side is lengthened by 0.25cms. and the longer side is shortened by 
0.125cms. 

2. Two sides of a triangle are measured as x = 150cms. and y = 200cms. while the angle 
included between them is measured as 6 = 60°. Calculate the area of the triangle. 

If there are possible errors of ±0.2cms. in the measurement of the sides and ±1° in the 
angle, determine, approximately, the maximum possible error in the calculated area of 
the triangle. 

State your answers correct to the nearest whole number. 

(Hint use the formula, Area = \xy sin#). 

3. Given that the volume of a segment of a sphere is | x(x 2 + 3 y 2 ) where x is the height 
and y is the radius of the base, obtain, in terms of x and y, the percentage error in the 
volume when x is too large by 1% and y is too small by 0.5%. 

4. If 

z = kx 0m y 008 , 

where k is a constant, calculate, approximately, the percentage change in z when x is 
increased by 2% and y is decreased by 1%. 


9 



5. If 


5 xy A 


calculate, approximately, the maximum percentage error in w if x, y and z are subject 
to errors of ±3%, ±2.5% and ±4%, respectively. 

6. If 

_ 1 

w = 2 xyz 2 , 


where x and z are subject to errors of 0.2%, calculate, approximately, the percentage 
error in y which results in w being without error. 

14.3.5 ANSWERS TO EXERCISES 

1. 0.05cms. 

2. 12990cms. 2 and 161cms. 2 

q 3x 2 
x 2 +3y 2 ' 

4. z decreases by 0.06%. 

5. 25%. 

6 . - 0 . 1 %. 
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UNIT 14.4 - PARTIAL DIFFERENTIATION 4 


EXACT DIFFERENTIALS 
14.4.1 TOTAL DIFFERENTIALS 

In Unit 14.3, use was made of expressions of the form, 


df 

dx 


Sx + 


d _l 

dy 


5y + . 


*? 


as an approximation for the increment (or error), df, in the function, f{x,y, ...), when x, y 
etc. are subject to increments (or errors) of Sx, 5y etc., respectively. 

The expression may be called the “total differential” of f(x,y,...) and may be denoted 
by df, giving 


df ~ 5f. 


OBSERVATIONS 

Consider the formula, 


(a) In 

words, 


. .. df df 

df = -^Sx+^-Syd- 
dx dy 


the special case when f(x,y,...) = x, we may conclude that df 


dx or, in other 


da; = Sx. 

(b) In the special case when f(x,y,...) = y, we may conclude that df = dy or, in other 
words, 


dy = dy. 

(c) Observations (a) and (b) imply that the total differential of each independent variable 
is the same as the small increment (or error) in that variable; but the total differential of the 
dependent variable is only approximately equal to the increment (or error) in that variable. 
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(d) All of the previous observations may be summarised by means of the formula 


df = ^-dx+^-dy + ... 
dx ay 

14.4.2 TESTING FOR EXACT DIFFERENTIALS 

In general, an expression of the form 

P{x, y, ...)dx + Q(x, y, ...)d y + . . . 

will not be the total differential of a function, f(x,y,...), unless the functions, P(x,y, 
Q(x,y , ...) etc. can be identified with ^ etc., respectively. 

If this is possible, then the expression is known as an “exact differential” . 

RESULTS 

(i) The expression 


P(x,y)dx + Q(x,y)dy 
is an exact differential if and only if 

dP _ dQ 
dy dx 

Proof: 

(a) If the expression, 


P(x,y) dx + Q(x,y)dy, 


is an exact differential, d /, then 


d _l 

dx 


P(x,y ) and 


df_ 

dy 


Q(x,y). 
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Hence, it must be true that 


dP _ d Q ( d 2 f \ 
dy dx y dxdy J 


(b) Conversely, suppose that 


dP_ = dQ 
dy dx 


We can certainly say that 


P(x,y) 


du 

dx 


for some function u(x,y), since P(x,y) could be integrated partially with respect to x. 
But then, 


dQ _ dP _ d 2 u 
dx dy dydx ’ 


and, on integrating partially with respect to x, we obtain 


Q{x,y) 


du 

dy 


+ Mv ), 


where A(y) is an arbitrary function of y. 
Thus, 


du ( du \ 

P(x, y) dx + Q(x, y)dy = —dx + ( ^ + A (v) j ^ 


and the right-hand side is the exact differential of the function, 


u(x,y) + J A(y) dy. 
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(ii) By similar reasoning, it may be shown that the expression 


P(x , y, z)dx + Q(x, y, z)dy + R(x, y, z)dz 


is an exact differential, provided that 

dP _d Q dQ _ dR dR _ dP 

dy dx ’ dz dy ’ dx dz 


ILLUSTRATIONS 

1 . 


2 . 

3. 


xdx + ydy = d 


x ( x 2 + y 2 ) 


ydx + xdy = d [xy\. 


ydx — xdy 


is not an exact differential since 


dy_ 

dy 


1 and 


d(—x) 

dx 


- 1 . 


4. 


2 In ydx + (x + z)dy + z 2 dz 


is not an exact differential since 

<9(2 In y) 2 d(x + z) 

— = -, and — = 1. 

dy y dx 
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14.4.3 INTEGRATION OF EXACT DIFFERENTIALS 


In section 14.4.2, the second half of the proof of the condition for the expression, 

P(x,y)dx + Q(x,y)dy, 

to be an exact differential suggests, also, a method of determining which function, f(x,y), 
it is the total differential of. The method may be illustrated by the following examples: 

EXAMPLES 

1. Verify that the expression, 


(x + y cosx)da; + (1 + sin x)dy, 

is an exact differential, and obtain the function of which it is the total differential. 

Solution 

Firstly, 

d d . . . 

— [x + y cos op = — (1 + sm a;) = cos a;; 
ay ox 

and, hence, the expression is an exact differential. 

Secondly, suppose that the expression is the total differential of the function, f(x,y). 
Then, 


df 

— = x + y cos x 
ox 


( 1 ) 


and 


df 1 

— = 1 + Sill X. 

ay 


-( 2 ) 


Integrating (1) partially with respect to x gives 


x 


f(x,y) = y +ysinx + A(y), 


where A(y) is an arbitrary function of y only. 
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Substituting this result into (2) gives 


That is, 


and, hence, 


dA i . 
sin x + — — = 1 + smj. 
d y 



A(y) = y + constant. 


We conclude that 


x 


f(x,y ) = — + y sin x + y + constant. 


2. Verify that the expression, 

(yz + 2)da; + (xz + 6y)dy + ( xy + 3z 2 )dz, 

is an exact differential and obtain the function of which it is the total differential. 

Solution 

Firstly, 

d d 

dy( yZ + 2) = Q^(^ + 6y) = z, 


d . d 2 

[xz + 6 y) = ^(xy + 3^ ) = x, 


and 


d d 

— (xy + 3^ 2 ) = — (yz + 2 ) = y, 


so that the given expression is an exact differential. 
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Suppose it is the total differential of the function, F(x,y,z). 
Then, 


d F 

dx 
d F 

dy 
d F 

dz 


yz + 2, 
xz + 6 y, 
xy + 3z 2 . 


Integrating (1) partially with respect to x gives 


-( 1 ) 

( 2 ) 

( 3 ) 


F(x, y, z) = xyz + 2x + A(y, z), 

where A(y, z) is an arbitrary function of y and z only. 
Substituting this result into both (2) and (3) gives 


That is, 


xz + 


dA 

dy 



xz + 6 y, 
xy + 3z 2 . 


da 

dy 

= 6 y, 

( 4 ) 

dA 

dz 

= 3z 2 . 

(5) 


Integrating (4) partially with respect to y gives 


A (y,z) = 3 y 2 + B(z), 


where B(z) is an arbitrary function of z only. 
Substituting this result into (5) gives 
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which implies that 


B(z) = z 3, + constant. 


We conclude that 


F(x, y, z ) = xyz + 2x + 3 y 2 + £ 3 + constant 


14.4.4 EXERCISES 


1. Verify which of the following are exact differentials and integrate those which are: 


5x + 12 y — 9)dx + (2x + 5y — 4)d y; 


12x + 5 y — 9)dx + (5x + 2 y — 4)d y; 


(3x 2 + 2y + l)dx + (2x + 6 y 2 + 2)d y; 


(y — e x )dx + xdy; 


-dx - ( + 2 .x- N ) dy; 


cos(x + y)dx + cos(y — x)dy; 


2. Verify that the expression, 


(1 — cos 2x)dy + 2 y sin 2xdx. 


3x 2 dx + 2yzdy + y 2 dz, 


is an exact differential and obtain the function of which it is the total differential. 



3. Verify that the expression, 


e xy [y sin zdx + x sin zdy + cos zdz ] , 

is an exact differential and obtain the function of which it is the total differential. 


14.4.5 ANSWERS TO EXERCISES 

1. (a) Not exact; 

(b) 


6x 2 + 5 xy — 9x + y 2 — Ay + constant; 


(d) 

(e) Not exact; 

(f) Not exact; 

(g) 


a: 3 + 2 xy + x + 2 y 3 + 2y + constant; 


xy — e x + constant; 


2 . 

3. 


y( 1 — cos 2x) + constant. 


x 3 + y 2 z ; 


e xy sin^. 
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UNIT 14.5 - PARTIAL DIFFERENTIATION 5 


PARTIAL DERIVATIVES OF COMPOSITE FUNCTIONS 
14.5.1 SINGLE INDEPENDENT VARIABLES 

In this Unit, we shall be concerned with functions, f(x,y...), of two or more variables in 
which those variables are not independent, but are themselves dependent on some other 
variable, t. 

The problem is to calculate the rate of increase (positive or negative) of such functions with 
respect to t. 

Let us suppose that the variable, t, is subject to a small increment of St, so that the variables 
x,y... are subject to small increments of 5x, Sy, . . . , respectively. Then the corresponding 
increment, Sf, in f(x,y...) is given by 


Sf - -^Sx + -w-Sy + . . ., 

ox ay 

where we note that no label other than / is being used, here, for the function of several 
variables. That is, it is not essential to use a specific formula, such as w = f{x,y..). 

Dividing throughout by St gives 

Sf df Sx df Sy 

St dx St dy St + 

Allowing St to tend to zero, we obtain the standard result for the “total derivative” of 
f(x,y..) with respect to t, namely 

df df da; df dy 

d t dx ' dt dy ' df ’ 

This rule may be referred to as the “chain rule”, but more advanced versions of it will 
appear later. 
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EXAMPLES 


1. A point, P, is moving along the curve of intersection of the surface whose cartesian 
equation is 

X ^ y ^ 

— — — = z (a Paraboloid) 

16 9 

and the surface whose cartesian equation is 


x 2 + y 2 = 5 (a Cylinder) . 


If x is increasing at 0.2 cms/sec, how fast is z changing when x = 2 ? 

Solution 

We may use the formula 


dz 

dt 


dz da; 
dx d t 


+ 


dz d y 
dy dt ' 


where 


da; 

dt 


0.2 and 


dy 

dt 


dy da; dy 

-A — _ 0.2-—. 
da; dt da; 


But, from the equation of the paraboloid, 


dz x dz 2 y 

d^ = 8 and dy = __ 9“ ; 

and, from the equation of the cylinder, 

dy x 

da; y 


Substituting x = 2 gives y — ±1 on the curve of intersection, so that 


dz 

dt 


(!) (0 - 2 > + (-§) ' ±1)(0 - 2 > ( if) 



5 

36 


cms/sec. 
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2. Determine the total derivative of u with respect to t in the case when 


u = xy + yz + zx, x = e*, y = e * and z = x + y. 


Solution 

We may use the formula 


where 


and 


d u 
d t 


du da; 
dx d t 


+ 


du d y 
dy dt 


+ 


du d z 
d?df’ 


du du 

^~ = y + z, vp 
dx dy 


du 

z + x , w- 
dz 


x + y 


dx , dy 

— = e 1 = x, 
dt dt 



dz 

dt 


= e* — e 1 = x — y. 


Hence, 


That is, 


— = (y + z)x - (z + x)y + (x + y)(x- y) 

= —zy + zx + x 2 — y 2 
= z(x-y) + (x — y)(x + y). 


du 

dt 


(x - y)(x + y + z). 
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14.5.2 SEVERAL INDEPENDENT VARIABLES 


We may now extend the work of the previous section to functions, f(x,y..), of two or more 
variables in which x,y.. are each dependent on two or more variables, s,t.. 

Since the function, f(x,y..), is dependent on s,t.., we may wish to determine its partial 
derivatives with respect to any one of these (independent) variables. 

The result previously established for a single independent variable may easily be adapted 
as follows: 


df df dx df dy 

ds dx ds ^ dy ds 

df df dx df dy 

dt dx dt ^ dy dt 

Again, this is referred to as the “chain rule” . 

EXAMPLES 

1. Determine the first-order partial derivatives of z with respect to r and 9 in the case 
when 


z = x 2 + y 2 , where x = r cos 6 and y = r sin 29. 


Solution 

We may use the formulae 


and 


These give 


dz dz dx dz dy 

dr dx dr dy dr 


dz 

dO 


dz dx 
dx d6 


+ 


dz dy 
dy dO 



2x cos 0 + 2y sin 20 
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2 r (cos 2 # + sin 2 2 6 


and 


Qz 

(ii) — = 2 x(—r sin 6) + 2y(2r cos 2d) 
Ou 


= 2 r 2 (2 cos 26 sin 26 — cos 6 sin 6) . 

2. Determine the first-order partial derivatives of w with respect to u, 6 and 0 in the 
when 

w = x 2 + 2y 2 + 2z 2 , 


where 


x = if sin 0 cos 0, y = u sin 0 sin 6 and z = ucoscj). 


Solution 

We may use the formulae 

dw dw dx dw dy dw dz 

du dx' du^ dy' du^ dz' du 


and 


These give 


dw dw dx dw dy dw dz 

d6 dx d6 ^ dy d6 dz d6 


dw dw dx dw dy dw dz 

d(j) dx ' d(f> dy'dcj)^ dz' d<t>' 



2x sin 0 cos 6 + Ay sin 0 sin 6 + Az cos 0 


= 2wsin 2 0cos 2 # + 4nsin 2 0sin 2 d + 4ncos 2 0; 


case 


5 



11 


dw 

~36 


—2 xu sin 0 sin 9 + Ayu sin 0 cos 9 


= — 2w 2 sin 2 0 sin 9 cos 9 + 4w 2 sin 2 0 sin 9 cos 9 


= 2m 2 sin 2 0 sin 9 cos 9\ 



2 xu cos 0 cos 9 + Ayu cos 0 sin 9 — Azu sin 0 


= 2u 2 sin 0 cos 0cos 2 9 + 4w 2 sin 0 cos 0sin 2 (9 — 4m 2 sin 0 cos 0 


= 2u 2 sin 0 cos 0 (cos 2 d + 2sin 2 0 — 2^ . 


14.5.3 EXERCISES 

1. Determine the total derivative of z with respect to t in the cases when 
(a) 


(b) 


z = x 2 + 3xy + 5y 2 , where x — sin t and y = cost; 


z = ln(x 2 + y 2 ), where x = e 1 and y = e 4 ; 


z = x 2 y 2 where x = 2 1 3 and y = 3t 2 . 

2. If z — f(x,y), show that, when y is a function of x, 

dz df df d y 

da; dx dy da; 

Hence, determine 4^ in the case when z = xy + x 2 y and y = In a;. 
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3. The base radius, r, of a cone is decreasing at a rate of O.lcms/sec while the perpendicular 
height, h. is increasing at a rate of 0.2cms/sec. Determine the rate at which the volume, 
V, is changing when r = 2cm and h = 3cm. (Hint: V = ( nr 2 h)/3 ). 

4. A rectangular solid has sides of lengths 3cms, 4cms and 5cms. Determine the rate of 
increase of the length of the diagonal of the solid if the sides are increasing at rates of 
.'cms./sec, perils. /sec and |cms/sec, respectively. 

5. If 

z = (2x + 3 y) 2 where x — r 2 — s 2 and y = 2 rs, 


determine, in terms of r and s the first-order partial derivatives of z with respect to r 
and s. 

6. If 

z — f(x,y) where x — e u cos v and y — e u smv, 

show that 


dz df df 
Ihi = X ~dx + V ~dy 


and 


dz df df 
= ~ V ~d~x +X dy' 


7. If 

w — 5x — 3 y 2 + 7 z 3 where x = 2s + 3t, y = s — t and z = As + t, 


determine, in terms of s and t, the first order partial derivatives of w with respect to s 
and t. 


14.5.4 ANSWERS TO EXERCISES 


1. (a) 


(b) 


— = 3 cos 2t — 4 sin 2t\ 
dt 


dz e 4t — 1 

dt e 4t + 1 


d^ 

dt 


360t 9 . 
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2 . 


U.A/ _ 

— = y + 2 xy + 1 + x. 
ax 

3. The volume is decreasing at a rate of approximately 0.42 cubic centimetres per second. 

4. The diagonal is increasing at a rate of approximately 0.42 centimetres per second. 


dz dz 

— = 8(r 2 — s 2 + 3rs)(2r + 3s) and — 1 = 8(r 2 — s 2 + 3rs)(3r — 2s). 
or os 

6. Results follow immediately from the formulae 

dz df dx df dy dz df dx df dy 

du dx du dy du dv dx dv dy dv 


— = 1344s 2 + 672 st + 84f 2 - 6s + 6t + 10 


— = 336s 2 + 168sf + 21t 2 + 6s - 6t + 15. 
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UNIT 14.6 - PARTIAL DIFFERENTIATION 6 


IMPLICIT FUNCTIONS 

14.6.1 FUNCTIONS OF TWO VARIABLES 

The chain rule, encountered earlier, has a convenient application to implicit relationships of 
the form, 


f(x, y ) = constant, 


between two independent variables, x and y. 

It provides a means of determining the total derivative of y with respect to x. 

Explanation 

Taking x as the single independent variable, we may interpret f(x,y ) as a function of x and 
y in which both x and y are functions of x. 

Differentiating both sides of the relationship, f(x,y) = constant, with respect to x gives 

df_ da; + df_ dy = Q 
dx da; dy da; 


In other words, 

9f_ + df_ dy_ = o 
dx dy da; 


Hence, 


dy 

da; 


91 

dx 

Of' 

dy 
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EXAMPLES 


1. If 


f(x, y) = x 3 + 4a; 2 y — 3 xy + y 2 = 0, 
determine an expression for 

Solution 


= 3x 2 + 8a;y — 3y and = Ax 2 — 3x + 2 y. 


dx 


Hence, 


dy 


d y 3x 2 + 8 xy — 3 y 

dx Ax 2 — 3x + 2y 


2. If 


f(x, y) = x sin(2a; — 3 y) + y cos(2a; — 3 y), 
determine and expression for 

Solution 


df_ 

dx 


= sin(2x — 3 y) + 2a;cos(2a; — 3 y) — 2ysm(2x — 3 y) 


and 


Hence, 


Of 

— = — 3a;cos(2a; — 3 y) + cos(2a; — 3 y) + 3y sin(2x — 3 y). 


dy sin(2x — 3 y) + 2x cos(2a: — 3 y) — 2 y sin(2x — 3y) 

da; 3a; cos(2a; — 3y) — cos(2a; — 3 y) — 3y sin (2a; — 3y) 


14.6.2 FUNCTIONS OF THREE VARIABLES 

For relationships of the form, 


f(x,y,z) = constant, 

let us suppose that x and y are independent of each other. 
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Then, regarding f(x,y,z ) as a function of x, y and z, where x, y and z are all functions of 
x and y, the chain rule gives 


df dx df dy df dz 

dx dx dy dx dz dx 


But, 


dx 

dx 


1 and 


dy i 
dx 


0 . 


Hence, 


df df dz 

— -| — — = n 

dx dz dx 


giving 


and, similarly, 


dz 

dx 


dl 

dx . 

df’ 

dz 


dz 

dy 


df 

dy 

df' 

dz 


EXAMPLES 


1. If 


f(x, y, z) = z 2 xy + zy 2 x + x 2 + y 1 


determine expressions for d f and . 

± ax ay 

Solution 


5, 


dx 


z 2 y + zy 2 + 2x, 
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and 


Hence, 


and 


df o 

— — = z 2 x + 2 zyx + 2 y 
dy 


dz 


2zxy + y 2 x. 


dz z 2 y + zy 2 + 2x 

dx 2 zxy + y 2 x 


dz z 2 x + 2 zyx + 2 y 

dy 2 zxy + y 2 x 


2. If 

f(x,y,z) = xe y2+2z 
determine expressions for ^ and '] z . 

L ax dy 

Solution 


_ p y 2 +2z 
dx 


and 


Hence, 


and 


d J_ 

dy 


2 yxe y2+2z 


5 


df 

dz 


2xe y2+2z . 


dz __ e y2+2z _ _ 1 
dx 2xe y2+2z 2x 


dz 2 yxe y2+2z 
dy ~ 2xe y2+2z 
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14.6.3 EXERCISES 


1. Use partial differentiation to determine expressions for ^ in the following 

(a) 

x 3 + y 3 — 2 x 2 y = 0; 

(b) 

e x cos y = e y sin a;; 


2. If 


sin x — 5 sin x cos y + tan y — 0. 


x 2 y + y 2 z + z 2 x = 10, 


where x and y are independent, determine expressions for 

dz dz 

— and — . 
ox ay 


3. If 


xyz — 2 sin(x 2 + y + z) + cos (xy + z 2 ) = 0, 
where x and y are independent, determine expressions for 

dz dz 

— and — . 
dx dy 


4. If 


r 2 sin 6 = (r cos 6 — 1 )z, 


where r and 6 are independent, determine expressions for 

dz dz 

Fr md W 


cases: 
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14.6.4 ANSWERS TO EXERCISES 


1. (a) 


(b) 


2 . 


and 


3. 


and 


4. 


and 


d y 4 xy — 3x 2 

da; 3 y 2 — 2x 2 ’ 


d y e x cos y — e y cos x 
da; x x sin y + e y sin x ’ 


d y 5 cos x cos y — 2 sin x cos x 

da; 5 sin xsiny + sec 2 y 


dz 2 xy + z 2 

dx y 2 + 2 zx 


dz x 2 + 2 yz 

dy y 2 + 2 zx 


dz yz — 4a; cos(a; 2 + y + z) — y sin(a;y + z 2 ) 

dx xy — 2 cos(a; 2 + y + z) — 2z sin(xy + z 2 ) 


dz xz — 2 cos(x 2 + y + z) — x sin (xy + z 2 ) 

dy xy — 2 cos(a; 2 + y + 2 ) — 2z sin(a;y + ^ 2 ) 


dz 2 r sin 6 — z cos 6 
dr r cos 6—1 


dz r 2 cos 9 + rz sin 9 
d9 r cos 9 — 1 
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UNIT 14.7 - PARTIAL DIFFERENTIATON 7 


CHANGE OF INDEPENDENT VARIABLE 
14.7.1 ILLUSTRATIONS OF THE METHOD 

In the theory of “partial differential equations” (that is, equations which involve partial 
derivatives) it is sometimes required to express a given equation in terms of a new set of 
independent variables. This would be necessary, for example, in changing a discussion from 
one geometrical reference system to another. The method is an application of the chain rule 
for partial derivatives and we illustrate it with examples. 

EXAMPLES 


1 . 


Express, in plane polar co-ordinates, r and 9, the following partial differential equations: 
(a) 


dV_ dV_ 
dx dy 


i; 


(b) 

d 2 V d 2 V 
dx 2 dy 2 


Solution 

Both differential equations involve a function, V(x,y), where 

x — r cos 9 and y — r sin 9. 


Hence, 


or 


and 


dV dV dx dV dy 
dr dx dr dy dr ’ 


dV 

dr 


dV n dV . n 
— — cos 9 + — — sm 9 
dx dy 


dV dV dx dV dy 
d9 dx ' d9 dy 'd9 ’ 
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or 


8V . . 8V 
— — r sm 9 + — — r cos 9 
ox 8y 


8V 
~89 

Now, we may eliminate, first . and then ^ to obtain 

dV q 8V sin 9 8V 
dx dr r 86 


and 


dV . ..8V cos 9 8V 
~dy = Sm9 lh + ~~86' 


Hence, differential equation, (a), becomes 


/ /I r • /a 8V f 5 cos 9 . \ 8V 

(cos 9 + 5 sm 9) — — + sm 9 — 

dr \ r 89 


= 1 . 


In order to find the second -order derivatives of V with respect to x and y , it is necessary 
to write the formulae for the first-order derivatives in the form 


and 



sin 9 8 \ 
r 89 ) 


cos 9 8 \ 
r 89 ) 


W] 

\y]- 


From these, we obtain 


8 2 V 8 (8V\ { 8 sin 9 8\( 8V sin0 9H\ 

dx 2 8x\dx) \ C ° S dr r 89 ) y C ° S dr r 89 ) 


which gives 

8 2 V 2 q 8 2 V 2 sin 9 cos 9 8V 2 sin# cos 6* 8 2 V sin 2 9 8V sh\ 2 9 8 2 V 

dx 2 C ° S dr 2 r 2 89 r 8r89 r dr r 2 89 2 
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Similarly, 

d 2 V . 2 d 2 V 2 sin 9 cos 9 dV 2 sin 0 cos 0 d 2 V cos 2 9 dV cos 2 0 d 2 V 

dy 2 s Q r 2 r 2 qq r drd6 r <9r r 2 d 9 2 

Adding these together gives the differential equation, (b), in the form 

d 2 V 1 dV 1 d 2 V _ 
dr 2 + r dr + r 2 d9 2 

2. Express the differential equation, 

d 2 V d 2 V d 2 V _ 

dx 2 ch/ 2 dz 2 ’ 

(a) in cylindrical polar co-ordinates, 
and 

(b) in spherical polar co-ordinates. 

Solution 

(a) Using 


x — r cos 9 : y — r sin 9 and z = z, 

we may use the results of the previous example to give 

d 2 V 1 dV 1 d 2 V d 2 V _ 

dr 2 r dr r 2 d9 2 dz 2 

(b) Using 


x = u sin 4> cos 9 , y = u sin (j) sin 9 , and z = u cos 0, 

we could write out three formulae for ^ and and then solve for and 

but this is complicated. 

However, the result in part (a) provides a shorter method as follows: 

Cylindrical polar co-ordinates are expressible in terms of spherical polar co-ordinates 
by the formulae 


z = u cos 0, r — u sin 0, 9 = 9. 
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Hence, by using the previous example with z, r, 9 in place of x, y, z respectively and 
u, 0 hr place of r, 9, respectively, we obtain 

d 2 V d 2 V _ d 2 V 1 dV 1 d 2 V 

dz 2 dr 2 du 2 ^ u du u 2 dtp 2 

Therefore, to complete the conversion we need only to consider ^ ; and, by using r, u, 
0 in place of y, r, 9, respectively, the previous formula for hh gives 

dV . , dV cos 0 dV 

dr r du u dp 

The given differential equation thus becomes 

d 2 V 1 dV 1 d 2 V 1 d 2 V 1 

du 2 ^ u du + u 2 <90 2 u 2 sin 2 0 89 2 ^ u sin 0 

That is, 

<9 2 1/ 2 0H I5 2 h cot 0.91/ 1 <9 2 1/ 

<9w 2 u du ^ u 2 (90 2 u 2 <90 + u 2 sin 2 0 89 2 


, <91/ cos 0 <91/ 

S1110— + -7T7- 

(9?i u 8<p 


= 0 . 


14.7.2 EXERCISES 

1. Express the partial differential equation, 

dV dV n 

= °> 

dx dx 

in plane polar co-ordinates, r and 6, where 

x = r cos 9 and y — r sin 9. 

2. Express the differential equation, 

dV dV 8V 
X lte +y lhj ~ ~dz' 

in spherical polar co-ordinates u, 9 and 0, where 


x = w sin 0 cos 9, y — u sin 0 sin 9 and z = u cos0. 
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3. A function 4>(x, t ) satisfies the partial differential equation, 

<9 2 0 1 d 2 0 

dx 2 k 2 dt 2 ’ 


where A; is a constant. 

Express this equation in terms of new independent variables, u and v, where 

1 1 

x — ~{u + v) and t — — (tt — v 

Z Zrv 

4. A function 9(x,y) satisfies the partial differential equation, 


d 2 e dH_ _ 

dx 2 d y 2 

Express this equation in terms of new independent variables, s and t, where 

x — lira and y — lnv. 

Determine, also, an expression for in terms of 9, u and v. 


14.7.3 ANSWERS TO EXERCISES 


1 . 


dV 

dr 


= 0 . 


dV cos 0 + sin 0 dV 

sm 0 — cos 0) — — = 0. 

du u dcp 


and 


<9 2 0 


= 0 . 


u 


dudv 

,d 2 9 2 d 2 9 d6 d9 

+ v x— + u— + v— = 0, 


dx 2 dv 2 du dv 


d 2 e _ d 2 e 

dxdy dudv 
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UNIT 14.8 - PARTIAL DIFFERENTIATION 8 
DEPENDENT AND INDEPENDENT FUNCTIONS 
14.8.1 THE JACOBIAN 

Suppose that 


u = u(x, y ) and v = v(x, y ) 

are two functions of two independent variables, x and y; then, in general, it is not possible 
to express u solely in terms of v, nor v solely in terms of u. 

However, on occasions, it may be possible, as the following illustrations demonstrate: 


ILLUSTRATIONS 

1. If 


then 


which gives 


Hence, 


2. If 


then 


x + y , x — y 
u = and v = , 


x 


y 


. X X 

u = 1 H — and v = 1, 

y y 


(w-l)(u + l) = = 1 . 

y x 


l , l 

u = 1 H r and v = : — 1. 


v + 1 


u — 1 


u = x + y and v = x 2 + 2 xy + y z , 


v = u 2 and u = ±y/v. 

If u and v are not connected by an identical relationship, they are said to be “independent 
functions” . 
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THEOREM 


Two functions, u(x,y) and v(x,y), are independent if and only if 


du 

du 

dx 

dy 

dv 

dv 

dx 

dy 


# 0 . 


Proof: 


We prove an equivalent statement, namely that u(x, y) and v(x, y) are dependent if and only 
if 


du 

du 

dx 

dy 

dv 

dv 

dx 

dy 


(a) Suppose that v is dependent on u by virtue of the relationship 


v = v(u). 


By expressing and in terms of and we shall establish that the determinant, J, 
is identically equal to zero. 

We have 


dv _ du du ^ dv _ du du 
dx d u dx dy du dy 


Thus, 


dv dv 

dx — dy _ 
du du 

dx dy 


du 

du 


or 


du dv dv du 
dx dy dx dy 
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which means that the determinant, J, is identically equal to zero, 
(b) Secondly, let us suppose that 


du 

du 

dx 

dy 

dv 

dv 

dx 

dy 


and attempt to prove that u(x,y ) and v(x,y ) are dependent. 


In theory, we could express v in terms of u and x by eliminating y between u(x, y) and 
v(x,y). 


We shall assume that 


v = A(u, x). 

By expressing anc [ hr j n terms of and |^, we may show that A(u,x ) does not contain 
x. 

We have 

(v) s (r) + (w) 

\ du J X \ dx Jy \ dx Ju 

and 

(t) s (r) -ft) ■ 

vw* xdyJx 

Hence, if the determinant, J, is identically equal to zero, we may say that 

fir) (fO 

\dxjy \oy J x 

ft 1 ) ■(¥) + (£*) ft 1 ) -f^) 

\du J x \dyjy \dx J u \duj x \dy J x 

and, on expansion, this gives 

t) ( t ) s0 - 

d y) x \ dx )u 
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If the first of these two is equal to zero, then u contains only x and, hence, x could be 
expressed in terms of u, giving v as a function of u only. If the second is equal to zero, then 
A contains no ads and, again, v is a function of u only. 

Notes: 

(i) The determinant 


du 

du 

dx 

dy 

dv 

dv 

dx 

dy 


may also be denoted by 


d (u, v ) 
d(x ,y) 


and is called the “Jacobian determinant” or simply the “Jacobian” of u and v with 

respect to x and y. 

(ii) Similar Jacobian determinants may be used to test for the dependence or independence 
of three functions of three variables, four functions of four variables, and so on. 

For example, the three functions 


u = u(x,y,z), v = v(x,y,z) and w = w(x,y,z ) 


are independent if and only if 


j _ d (u, v, w) 
d(x,y,z) 


du 

du 

du 

dx 

dy 

dz 

dv 

dv 

dv 

dx 

dy 

dz 

dw 

dw 

dw 

dx 

dy 

dz: 
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ILLUSTRATIONS 


x + y x — y 

u = and v = 

x v 


are not independent, since 


J = 


y_ i 


= 0 


xy xy 


2 . 


u = x + y and v = x 2 + 2 xy + y 2 
are not independent, since 


J = 


1 

2x + 2 y 


1 

2a: + 2 y 


= 0 . 


3. 


u = x 2 + 2y and v = xy 


are independent, since 


J = 


2x 2 

y X 


= 2a:^ -2y^0. 


4. 


u = x 2 — 2y + z, v = x + 3y 2 — 2z, and w = 5x + y + z 2 
are not independent, since 


J = 


2x -2 1 

1 6 y -2 

5 1 2z 


= 24 xyz + 4a: — 30 y + 4z + 25 ^ 0. 
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14.8.2 EXERCISES 


1. Determine which of the following pairs of functions are independent: 
(a) 

u = xcosy and v = xsmy ; 


(b) 


u = x + y and v = 


y 


x + y 


(d) 


2. Show that 


and 


u = x — 2y and v = x 2 + 4 y 2 — 4 xy + 3x — 6y; 
u = x + 2y and v = x 2 — y 2 + 2 xy — x. 


u = x + y + z, v = x 2 + y 2 + z 2 ~ 2 xy — 2 yz — 2^a: 


tc = x 3 + y 3 + — 3 xyz 


are dependent. 

Show also that w may be expressed as a linear combination of u 3 and uv. 

3. Given that 


x + y + z = u, y + z = uv and z = uvw, 

express x and y in terms of u, v and w. 

Hence, show that 

9(x,y,z) 2 

o? T = u lK 

o{u, V, W) 
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14.8.3 ANSWERS TO EXERCISES 


1. (a) Independent, since J = x; 

(b) Independent, since J = 1 /(x + y)\ 

(c) Dependent, since J = 0; 

(d) Independent, since J = 2 — 2x — Qy. 



3. 


x = u — uv and 


+ 3wn . 


y = uv — uvw. 
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UNIT 14.9 - PARTIAL DIFFERENTIATION 9 


TAYLOR’S SERIES FOR FUNCTIONS OF SEVERAL VARIABLES 
14.9.1 THE THEORY AND FORMULA 

Initially, we shall consider a function, f(x,y ), of two independent variables, x, y, and obtain 
a formula for f{x + h, y + k) in terms of f(x, y) and its partial derivatives. 

Suppose that P,Q and R denote the points with cartesian co-ordinates, ( x,y ), (x + h,y) and 
(x + h, y + k), respectively. 


V 


§ 


R 


p Q ,T 


(a) As we move in a straight line from P to Q , y remains constant so that f(x,y ) behaves 
as a function of x only. 

Hence, by Taylor’s theorem for one independent variable, 


f{x + h,y) 


f( x i y) + fx(x, y) + 


/P 

2 ! 


fxx(x,y) + . . ., 


where f x (x, y) and f xx (x, y ) mean and respectively, with similar notations encountered 
in what follows. 

hr abbreviated notation, 


/(Q) = /( P) + hf,( P) + ^/«(P) + . . 
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(b) As we move in a straight line from Q to R, x remains constant so that f(x,y ) behaves 
as a function of y only. 

Hence, 


k 2 

f(x + h,y + k) = f(x + h, y) + kf x (x + h,y) + ^f xx (x + h, y) + . . 


or, in abbreviated notation, 


/(R) = /(Q) + kf,( Q) + ^/„(Q) + . . 


(c) From the result in (a) 


fy{ Q) 


fy( P) + fc/»*(P) + 


h 2 

2 ! 


fyxx(P ) + • • • 


and 


fyy( Q) 


h 2 

tPi _L_ h f fPi -I 


(d) Substituting the results into (b) gives 


/( R) = /(P) + /h/*(P) + ^(P) + ^ [h 2 f XX { P) + 2hA/ ya; (P) + k 2 fyy( P) 


A • • 


It may be shown that the complete result can be written as 


f(x + h,y + k) = f(x , y) + + k^\ f(x , y)+ 


1 

2 ! 




2 

f(x,y) + 


l 

3 ! 




3 

f(x,y) + • • • 


2 



Notes: 


(i) The equivalent of this result for a function of three variables would be 


f d d d \ 

f{x + h,y + k,z + l) = f(x,y,z ) + I h— + k— + l—j f(x,y,z)+ 


l ( , d d , <9 \ 2 1 /, <9 , d <9 \ 3 

v\ h dx + % +l az) f ( X ' y ’ z) + 3’\ h Vx +k dy +l rz) /(W)+ 

(ii) Alternative versions of Taylor’s theorem may be obtained by interchanging x, y,z... with 
h , k, 

For example, 


f(x + h,y + k) = f(h , k) + (x^ + y^- J f(h, k)+ 


1 ( d d f , 1 ( d d \ 3 , . 

2 l[ X dx +y dy) /(M) + 3! l[ X Vx +y dy) /(M) + ' 


(iii) Replacing x with x — h and y with y — k in (ii) gives the formula, 


f(x,y) = f(h,k ) + r ~ h )^ x + (y~ k ^^y) f( h ’ k ) + 


U {x - h) Vx +{y - k) §u) f{h ’ k) + U {x ~' h) § x + {y - k) i) /( ' a) + 


This is called the “Taylor expansion of f(x,y ) about the point (a, 6)’ 
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(iv) A special case of Taylor’s series (for two independent variables) is obtained by putting 
h — 0 and k — 0 in (ii) to give 


f(x, y) = /(0, 0) + (z A + y A) m 0) + l (z A + J, A) /(0, 0) + . . . 

This is called a “MacLaurin’s series” but is also the Taylor expansion of f(x,y) about 
the point (0, 0). 

EXAMPLE 

Determine the Taylor series expansion of the function f (x + l,y + ^ in ascending powers 
of x and y when 


f(x,y) = sin xy, 


neglecting terms of degree higher than two. 

Solution 

We use the result that 


f (x + l,y+ 7 ^J 






+ 


1 

2 ! 


' d 
x- r + y 

, ox 



f 



+ . . ., 


in which the first term on the right has value v / 3/2. 
The partial derivatives required are as follows: 


Of 7T 7T 

— = ycosxy giving -- at x = 1, y - ; 

ox 6 3 


8*1 

dx 2 


df 

— — = x cos xy giving 
dy 


= —y 2 sin xy giving 


1 

- at x — 


tt 2 \/3 

18 


at 


1, 


7T 



a; = 1, 


7T 
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9 2 f 

dxdy 


cos xy — xy sin xy giving 


1 

2 


a t x = 1 

6 


n 



d 2 f 2 • • • V3 vr 

— = -x sin xy giving at x = 1, y = 


Neglecting terms of degree higher than two, we have 


7 r 1 2 / 1 

sm xy = ) — x -\ — y a: + - 

y 2 6 2 y 36 1 2 


1 vrv^ 


V3 


6 / xy - T® + 


14.9.2 EXERCISES 


1. If f(x,y ) = x 3 — 3 xy 2 , show that 


/( 2 + h, 1 + fc) = 2 + 9h — 12/c + 6 (h 2 — hk — A; 2 ) + h 3 — 3 hk 2 . 


2. If f(x,y ) = sin a: coshy, evaluate all the partial derivatives of f(x,y ) up to order five at 
the point, (a;, y) = (0,0), and, hence, show that 


1 1 

sin a? coshy = x — - ( x 3 — 3 xy 2 ^j + — — (a; 5 — 10a; 3 y 2 + 5 xy A ^j + 


3. If 2 is a function of two independent variables, x and y, where y = z — xsinz, evaluate 
all the partial derivatives of z(x,y ) up to order three at the point, (x,y) = (0,0), and, 
hence, show that 


z(x, y) = y + xy + x 2 y + . . . 
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UNIT 14.10 - PARTIAL DIFFERENTIATION 10 


STATIONARY VALUES FOR FUNCTIONS OF TWO VARIABLES 
14.10.1 INTRODUCTION 

If f(x,y ) is a function of the two independent variables, x and y , then the equation, 


* = f{x,y), 


will normally represent some surface in space, referred to cartesian axes, Ox, O y and O z. 



DEFINITION 1 

The “stationary points”, on a surface whose equation is z = f(x,y), are defined to be the 
points for which 


dz 

dx 


0 and 


dz 

dy 


0 . 


DEFINITION 2 

The function, z = f(x, y), is said to have a “local maximum” at a point, P(x 0 , Vo, zq), if 
is larger than the z co-ordinates of all other points on the surface, with equation z = f(x,y), 
in the neighbourhood of P. 

DEFINITION 3 

The function z = f(x, y) is said to have a “local minimum” at a point, P(xo, yo, z$), if Zo is 
smaller than the z co-ordinates of all other points on the surface, with equation z = f(x,y ), 
in the neighbourhood of P. 
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Note: 

At a stationary point, P(xo, yo, zo), on the surface with equation z = f(x,y), each of the 
planes, x = xq and y = yo, intersect the surface in a curve which has a stationary point at 
P. 

14.10.2 SUFFICIENT CONDITIONS FOR MAXIMA AND MINIMA 

A complete explanation of the conditions for a function, z = f(x, y ), to have a local maximum 
or a local minimum at a particular point require the use of Taylor’s theorem for two variables. 

At this stage, we state the standard set of sufficient conditions without proof. 

(a) Sufficient conditions for a local maximum 

A point, P(To, yo,zo), on the surface with equation z = f(x,y), is a local maximum if 


and 


dz dz 

dx ’ dy 


0, 


d 2 z 
dx 2 


0 


or 


d 2 z 
dy 2 


< 0 


d 2 z d 2 z f d 2 z \ 2 

dx 2 ' dy 2 y dxdy J 


when x = x 0 and y = yo- 

(b) Sufficient conditions for a local minimum 

A point, P(x 0 ,y 0 ,z 0 ), on the surface with equation z = f(x,y), is a local minimum if 


and 


dz dz 

dx ’ dy 


0, 


d 2 z 

dx 2 


or 


d 2 ^ 

dy 2 


> 0 


d 2 z d 2 z ( d 2 zV 

dx 2 ' dy 2 y dxdy J 


2 



when x = Xq and y — y 0 . 

Notes: 


(i) If is positive (or negative) and also > 0, then |4f is automatically 

positive (or negative). 

(ii) If it turns out that is negative at P, we have what is called a “saddle- 

o2 p{2 

point” , irrespective of what and themselves are. 

(iii) The values of z at the local maxima and local minima of the function, z = f(x,y), may 
also be called the “extreme values” of the function, f(x,y). 

EXAMPLES 

1. Determine the extreme values and the co-ordinates of any saddle-points of the function, 

z = x 3 + x 2 — xy + y 2 + 4. 


Solution 

(i) First, we determine jp and 


dz 

dx 


3x 2 + 2x — y and 


dz 

dy 


— x + 2 y. 


(ii) Secondly, we solve the equations = 0 and = 0 for x and y. 


3x 2 + 2 x — y = 0, 

—x + 2 y = 0. 

Substituting equation (2) into equation (1) gives 


That is, 


3a; 2 + 2a; — 



( 1 ) 

( 2 ) 


6a; 2 + 3a; = 0 or 3a;(2a; + 1) = 0. 


Hence, x = 0 or r = - with corresponding values, y — 0, z — 4 and y — z = 
— 1| , respectively. 

The stationary points are thus (0, 0, 4) and |, — 

(iii) Thirdly, we evaluate and at each stationary point. 


d 2 z 

dx 2 


6x + 2, 


c Pz 
dy 2 


2, 


and 


d 2 z 
dxdy 


- 1 . 
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(a) At the point (0,0,4), 


Hence, 


d 2 z 

dx 2 


d Vz 
d y 2 


2, 


and 


d 2 z 

dxdy 


- 1 . 


d 2 ^ (Pz d 2 z d 2 z { d 2 z \ 2 

dx 2 > ’ d y 2 > ’ anC dx 2 ' dy 2 y dxdy J 


3 > 0 


and, therefore, the point, (0,0,4), is a local minimum, with z having a corresponding 
extreme value of 4. 


(b) At the point (-§, f|), 


Hence, 


d 2 z 1 d 2 z n , d 2 z 
xxx> = - 1 , VTW = 2 , and = 

dx z dy z dxdy 


d 2 z 
dx 2 


< 0 , 


8 *z 
dy 2 


> 0 , 


and 


d 2 z c Pz 
dx 2 ’ dy 2 



-3 < 0 


and, therefore, the point, 


(_ 

1 

1 

65 \ 

{ 

2’ 

4’ 

16 J 


, is a saddle-point. 
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2. Determine the stationary points of the function, 


z = 2a; 3 + 6 xy 2 — 3 y 3 — 150a;, 


and determine their nature. 

Solution 

Following the same steps as in the previous example, we have 

dz dz 

— = 6a; 2 + 6 y 2 — 150 and — 1 = 12 xy — 9 y 2 . 
ox oy 

Hence, the stationary points occur where x and y are the solutions of the simultaneous 
equations, 


a; 2 + y 2 = 25, (1) 

y(Ax -3 y) = 0. (2) 


From the second equation, y = 0 or 4a; = 3 y. 

Putting y = 0 in the first equation gives x = ±5 and, with these values of x and y, we 
obtain stationary points at (5,0,-500) and (—5,0,500). 

Putting x y into the first equation gives y = ±4, x = ±3 and, with these values of 
x and y, we obtain stationary points at (3,4, —300) and (—3, —4,300). 

To classify the stationary points we require 


d 2 z 

dx 2 



12x — 18 y, 


and 


d 2 z 

dxdy 


12 y, 


and the conclusions are given in the following table: 


Point 

d 2 z 
dx 2 

d 2 Z 

dy 2 

d 2 z 

dxdy 

d 2 z d 2 z ( a 2 z \ 2 

dx 2 ' dy 2 \ dxdy ) 

Nature 

(5,0,-500) 

60 

60 

0 

positive 

minimum 

(-5,0,500) 

-60 

-60 

0 

positive 

maximum 

(3,4,-300) 

36 

-36 

48 

negative 

saddle-point 

(-3,-4,300) 

-36 

36 

-48 

negative 

saddle-point 


Note: 

The conditions used in the examples above are only sufficient conditions; that is, if the 
conditions are satisfied, we may make a conclusion. But it may be shown that there are 
stationary points which do not satisfy the conditions. 
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Outline proof of the sufficient conditions 

From Taylor’s theorem for two variables, 

f(a+h, b+k)-f(a, b ) = hf x (a, b)+kf y (a, b)+- ( h 2 f xx (a , b) + 2 hkf xy (a, b) + k 2 f yy (a , &))+ . . , 

where h and k are small compared with a and b, f x means 0 f y means 0 f xx means 0, 
f yy means 0 and f xy means J0. 

If f x (a, b) = 0 and f y (a, b) = 0, the conditions for a local minimum at the point (a, b , f(a , b)) 
will be satisfied when the second term on the right-hand side is positive; and the conditions 
for a local maximum at this point are satisfied when the second term on the right is negative. 

We assume, here, that later terms of the Taylor series expansion are negligible. 

Also, it may be shown that a quadratic expression of the form 


Lh 2 + 2 Mhk + Nk 2 


is positive when L > 0 or iV > 0 and LN — M 2 > 0; but negative when L < 0 or N < 0 and 
LN - M 2 > 0. 

If it happens that LN — M 2 < 0, then it may be shown that the quadratic expression may 
take both positive and negative values. 

Finally, replacing L, M and N by f xx (a,b), f yy (a,b ) and f xy (a,b ) respectively, the sufficient 
conditions for local maxima, local minima and saddle-points follow. 
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14.10.3 EXERCISES 


1. Show that the function, 


z = 3a; 3 — y 3 — 4a; + 3 y, 

has a local minimum value when x — y — — 1 and calculate this minimum value. 
What other stationary points are there, and what is their nature ? 

2. Determine the smallest value of the function, 

z = 2x 2 + y 2 — 4a; + 8 y. 

3. Show that the function, 


z = 2 x 2 y 2 + x 2 + Ay 2 — 12 xy, 

has three stationary points and determine their nature. 

4. Investigate the local extreme values of the function, 


z = x 3 + y 3 + 9(x 2 + y 2 ) + 12 xy. 

5. Discuss the stationary points of the following functions and, where possible, determine 
their nature: 


z = x 2 — 2 xy + y 2 ; 


(b) 


z = xy. 


Note: 

It will not be possible to use all of the standard conditions; and a geometrical argument 
will be necessary. 
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14.10.4 ANSWERS TO EXERCISES 


1. — 1, — y) is a local minimum; 

|, 1, 34 j is a local maximum; 

(|, 1, is a saddle-point; 

—1, — is a saddle-point. 

2. The smallest value is —18, since there is a single local minimum at the point (1, —4, —18). 

3. (0, 0, 0) is a saddle-point; 

(2, 1, —8) is a local minimum; (Hint: try x — y |^) 

(—2, —1, —8) is a local minimum. 

4. (0, 0, 0) is a local minimum; 

(—10, —10, 1000) is a local maximum; (Hint: try || — |^) 

(—4, 2, 28) is a saddle-point; 

(2, —4, 28) is a saddle-point. 

5. (a) Points (a, a, 0) are such that = 0 and = 0; but = 0. In fact 

the surface is a “parabolic cylinder” which contains the straight line x = y, z = 0 
and is symmetrical about the plane x = y. 

(b) (0, 0, 0) is a saddle-point since z may have both positive and negative values in the 
neighbourhood of this point. 
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UNIT 14.11 - PARTIAL DIFFERENTIATION 11 


CONSTRAINED MAXIMA AND MINIMA 

Having discussed the determination of local maxima and local minima for a function, f(x, y , ...), 
of several independent variables, we shall now consider that an additional constraint is im- 
posed in the form of a relationship, g(x,y , ...) = 0. 

This would occur, for example, if we wished to construct a container with the largest possible 
volume for a fixed value of the surface area. 

14.11.1 THE SUBSTITUTION METHOD 

The following examples illustrate a technique which may be used in elementary cases: 

EXAMPLES 

1. Determine any local maxima or local minima of the function, 

f(x,y) = 3x 2 + 2 y 2 , 

subject to the constraint that x + 2y — 1 = 0. 

Solution 

In this kind of example, it is possible to eliminate either x or y by using the constraint. 

If we eliminate x, for instance, we may write f(x,y) as a function, F(y ), of y only. 

In fact, 


f{x, y) = F{y ) = 3(1 - 2 y) 2 + 2y 2 = 3 - 12 y + Uy 2 . 

Using the principles of maxima and minima for functions of a single independent vari- 
able, we have 


F\y) = 282/ — 12 and F" = 28 

and, hence, a local minimum occurs when y = 3/7 and hence, x = 1/7. 
The corresponding local minimum value of f{x,y) is 


3 




21 

49 


3 

7' 


1 



2. Determine any local maxima or local minima of the function, 

f(x,y,z ) = x 2 + y 2 + z 2 , 

subject to the constraint that x + 2y + 3z — 1. 

Solution 

Eliminating x, we may write f(x,y,z ) as a function, F(y,z), of y and z only. 
In fact, 

f(x, y, z) = F(y , z) = (1 - 2y - 3z) 2 + y 2 + £ 2 . 

That is, 

F(r/, z) = 1 — 4r/ — 6z + 12?/^ + 5 y 2 + 10^ 2 . 


Using the principles of maxima and minima for functions of two independent variables 
we have, 

dF , <9F 

— — = — 4 + 12z + 10?/ and — — = — 6 + 12 y + 20^, 
ay oz 

and a stationary value will occur when these are both equal to zero. 

Thus, 


5y + 6z = 2, 
6 y + 10 2 = 3, 


which give y = 1/7 and z = 3/14, on solving simultaneously. 

The corresponding value of x is 1/14, which gives a stationary value, for f(x,y,z), of 

14/(14) 2 = 4p 

Also, we have 


d 2 F 

dy 2 


10 > 0 , 


d 2 F 
dz 2 


20 > 0 , 


and 


d 2 F 

dydz 


which means that 


d 2 F d 2 F / d 2 F \ 2 
dy 2 ’ dz 2 \dydz ) 


200 - 144 > 0. 
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Hence there is a local minimum value, of x 2 + y 2 + z 2 , subject to the constraint that 
x + 2y + 3z — 1, at the point where 


x = 


1 

14’ 


y = -, and 2 


3 

14 


Note: 

Geometrically, this example is calculating the square of the shortest distance from the 
origin onto the plane whose equation is x + 2y + 3z = 1. 

14.11.2 THE METHOD OF LAGRANGE MULTIPLIERS 

In determining the local maxima and local minima of a function, f(x,y, ...), subject to the 
constraint that g(x,y, ...) = 0, it may be inconvenient (or even impossible) to eliminate one 
of the variables, x,y, .... 

An alternative method may be illustrated by means of the following steps for a function of 
two independent variables: 

(a) Suppose that the function, 2 = f(x,y), is subject to the constraint that g(x,y ) = 0. 

Then, since z is effectively a function of x only, its stationary values will be determined by 
the equation 


(b) From Unit 14.5 (Exercise 2), the total derivative of z = f{x,y) with respect to x, when 
x and y are not independent of each other, is given by the formula, 

dz df df d y 

dx dx dy da; 

(c) From the constraint that g(x,y ) = 0, the process used in (b) gives 

dg_ dg_ dy = Q 
dx dy da; 

and, hence, for all points on the surface with equation, g(x,y ) = 0, 
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dy 

da; 


dg 
dx 
dg ' 
dy 


Thus, throughout the surface with equation, g(x,y ) = 0, 

= df_ _ ( df \ (If) 

da; dx \dy ) ' 


(d) Stationary values of z, subject to the constraint that g(x,y ) = 0, will, therefore, occur 
when 


df_ dg_ _ df_ dg_ = Q 

dx dy dy dx 

But this may be interpreted as the condition that the two equations, 


df . dg 


0 and 


df . dg 


o, 


should have a common solution for A. 
(e) Suppose that 


4>{x, V, A) = }{x, y) + \g{x, y). 

Then <f>(x, y, A) would have stationary values whenever its first order partial derivatives with 
respect to x, y and A were equal to zero. 

In other words, 


df dg df dg 

b — ^ — 0, f) — ^ A — 

dx dx dy dy 


0, and g(x,y) 


0 . 
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Conclusion 


The stationary values of the function, z = f(x,y), subject to the constraint that g(x,y ) = 0, 
occur at the points for which the function 


y, A) = f(x, y) + A g(x, y) 


has stationary values. 

The number, A, is called a “Lagrange multiplier”. 

Notes: 

(i) In order to determine the nature of the stationary values of z, it will usually be necessary 
to examine the geometrical conditions in the neighbourhood of the stationary points. 

(ii) The Lagrange multiplier method may also be applied to functions of three or more 
independent variables. 

EXAMPLES 

1. Determine any local maxima or local minima of the function, 

z = 3x 2 + 2 y 2 , 

subject to the constraint that x + 2y — 1 = 0. 

Solution 
Firstly, we write 


4>(x, y, A) = 3a: 2 + 2 y 2 + A (a; + 2 y — 1). 

Then, 

dd> . d(b . , d(p 

— = 6a: + A, — = Ay + 2A and — = x + 2y - 1. 

ox oy o\ 

The third of these is already equal to zero; but we equate the first two to zero, giving 


6a: + A = 
2y + A = 


5 


0, 

0 . 



Eliminating A shows that 6a; — 2 y = 0, or y = 3a;; and, if we substitute this into the 
constraint, we obtain 7x — 1 = 0. 

Hence, 


1 



y — - and A 
7 


6 

7 


A single stationary point therefore occurs at the point where 


x = 


1 

r 



and z 



3 

7' 


Finally, the geometrical conditions imply that the stationary value of z occurs at a point 
on the straight line whose equation is x + 2y — 1 = 0. 



The stationary point is, in fact, a minimum value of z, since the function, 3a; 2 + 2 y 2 , 
has values larger than 3/7 ~ 0.429 at any point either side of the point, (1/7, 3/7) = 
(0.14, 0.43), on the line whose equation is x + 2y — 1 = 0. 

For example, at the points, 0.12,0.44) and (0.16,0.42), on the line, the values of z are 
0.4304 and 0.4296, respectively. 

2. Determine the maximum and minimum values of the function, z = 3a; + 4 y, subject to 
the constraint that x 2 + y 2 = 1. 

Solution 

Firstly, we write 


4>(x, y, A) = 3a; + 4 y + A(x 2 + y 2 — 1). 
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Then, 


86 s 86 , , 86 n 9 

— = 3 + 2Xx, — = 4 + 2Xy and — = x 2 + y 2 - 1. 

cte ay aX 


The third of these is already equal to zero; but we equate the first two to zero, giving 


3 + 2Xx = 0, 
2 + Xy = 0. 


Thus, 


x = and y = 

2X u 

which we may substitute into the constraint to give 


2 

A’ 


9 4 

4P + P 


That is, 


5 

9 + 16 = 4A 2 and hence A = ±-. 

We may deduce that x — ±| and y = ±|, giving stationary values, ±5, of z. 

Finally, the geometrical conditions suggest that we consider a straight line with equation 
3a: + Ay = c (a constant) moving across the circle with equation x 2 + y 2 = 1. 
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The further the straight line is from the origin, the greater is the value of the constant, 
c. 

The maximum and minimum values of 3 x+4y, subject to the constraint that x 2 +y 2 = 1 
will occur where the straight line touches the circle; and we have shown that these are 
the points, (3/5, 4/5) and (—3/5,— 4/5). 

3. Determine any local maxima or local minima of the function, 

w = x + y + z , 

subject to the constraint that x + 2y + 3z = 1. 

Solution 
Firstly, we write 


4>(x, y, z, A) = x 2 + y 2 + z 2 + A (a; + 2y + 3z — 1). 


Then, 


dd) , dd) , d(b . , dd) 

— = 2x + A, — = 2y + 2A, — = 2^ + 3A and — = x + 2y + 3z — 1. 
or ay oz a A 


The fourth of these is already equal to zero; but we equate the first three to zero, giving 


2x + A = 0, 
V + A = 0, 
2z + 3A = 0. 
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Eliminating A shows that 2x — y = 0, or y = 2x, and 6x — 2z = 0, or z = 3a;. 
Substituting these into the constraint gives 14a; = 1. 

Hence, 


x = 


y = 


z = 


and A = 


14 7 7 14 

A single stationary point therefore occurs at the point where 


x= y — z = 7 — and w — ( — 

A A ' & T ' 1 A \ A A 


14 


7 


14 


14 


1 

+ l ,7 


3 Y 

+ l Ii) 


1 

14' 


Finally, the geometrical conditions imply that the stationary value of w occurs at a 
point on the plane whose equation is x + 2y + 2>z = 1. 



The stationary point must give a minimum value of w since the function, x 2 + y 2 + z 2 , 
represents the square of the distance of a point, (x,y,z), from the origin; and, if the 
point is constrained to lie on a plane, this distance is bound to have a minimum value. 


14.11.3 EXERCISES 


1. In the following exercises, use both the substitution method and the Lagrange multiplier 
method: 

(a) Determine the minimum value of the function, 

z = x + y , 

subject to the constraint that x + y = 1. 

(b) Determine the maximum value of the function, 


* = xy, 


subject to the constraint that x + y = 15. 
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(c) Determine the maximum value of the function, 

z = x 2 + 3 xy — 5y 2 , 

subject to the constraint that 2x + 3 y = 6. 

2. In the following exercises, use the Lagrange multiplier method: 

(a) Determine the maximum and minimum values of the function, 

w = x — 2y + 5 z, 

subject to the constraint that x 2 + y 2 + z 2 = 30. 

(b) If x > 0, y > 0 and z > 0, determine the maximum value of the function, 

w = xyz, 

subject to the constraint that x + y + z 2 = 16. 

(c) Determine the maximum value of the function, 

w = 8a; 2 + 4 yz — 16 z + 600, 
subject to the constraint that 4a; 2 + y 2 + 4 z 2 = 16. 

14.11.4 ANSWERS TO EXERCISES 


1. (a) The minimum value is z = 1/2, and occurs when x = y = 1/2; 

(b) The maximum value is z ~ 56.25, and occurs when x = y = 15/2; 

(c) The maximum value is z = 9, and occurs when x = 3 and y — 0. 


2. (a) The maximum value is 30, and occurs when x — 1, y — —2 and z — 5; 

The minimum value is —30, and occurs when x = —1, y — 2 and z = —5; 


(b) The maximum value is 


4096 

25\/E 


73.27, 


and occurs when x = 32 / x/5 , y = 32/ a/ 5 and z = 4/\/5; 

(c) The maximum value is approximately 613.86, and occurs when x — 0, y — —2 and 


10 



“JUST THE MATHS” 


UNIT NUMBER 
14.12 

PARTIAL DIFFERENTIATION 12 
(The principle of least squares) 

by 

A. J. Hobson 


14.12.1 The normal equations 

14.12.2 Simplified calculation of regression lines 

14.12.3 Exercises 

14.12.4 Answers to exercises 




UNIT 14.12 - PARTIAL DIFFERENTIATION 12 


THE PRINCIPLE OF LEAST SQUARES 
14.12.1 THE NORMAL EQUATIONS 

Suppose two variables, x and y, are known to obey a “straight line law”, of the form 
y = a + bx, where a and b are constants to be found. 

Suppose also that, in an experiment to test this law, we obtain n pairs of values, (aq,//*), 
where i = 1,2,3 

If the values Xi are assigned values, they are likely to be free from error, whereas the 
observed values, //*, will be subject to experimental error. 

The principle underlying the straight line of “best fit” is that, in its most likely position, 
the sum of the squares of the //-deviations, from the line, of all observed points is a minimum. 

The Calculation 

The //-deviation, e*, of the point, (. x^yi ), is given by 

e* = yi — ( a + bxi). 

Hence, 

n n 

J2 = ^ - (° + bxi] 2 = P say. 

i— 1 i = 1 


Regarding P as a function of a and b, it will be a minimum when 


dP dP d 2 P d 2 P , d 2 P d 2 P 

A“ = 0 > AT = 0 ’ > 0 or ATT > °’ and AT ATT 

oa ob oa, ob 2 oa z ob z 


(d 2 p\ 2 
l dadb J 


For these conditions, we have 


dP n dP n 

-x- = “2 J2 iVi ~ ( a + bx i\ and AT = ~ 2 ^2 x ihJi + bx *h 

<xa i=l O' 3 i= 1 
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and these will be zero when 


[Vi ~ ( a + hx i\ = 0 (!) 

2—1 


and 

n 

Yj Xi[yi + bxi] = 0 (2). 

2=1 

From (1), 

n n n 

Y Vi~Y a ~Y bxi = o. 

2=1 2=1 2=1 

That is, 

n n 

Y Ui = na + t>Y X i ( 3 )‘ 

2=1 2=1 

From (2), 

n n n 

Y XiVi - (i Y • r ' ~ b Y x i ( 4 )- 

2=1 2=1 2=1 

The statements (3) and (4) are two simultaneous equations which may be solved for a and 
b. 

They are called the “normal equations” 

A simpler notation for the normal equations is 


E y = na + x; 


E xy = oEx + 6E x 2 . 
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By eliminating a and b in turn, we obtain the solutions 


E a; 2 .E y — Erc.E xy 
n E x 2 - (E xf 


and b 


nTj xy — E x.E y 
nE a: 2 - (E x) 2 ' 


With these values of a and 6, the straight line with equation, y = a + bx, is called the 

“regression line of 7/ on a;”. 

Note: 

To verify that the //-deviations from the regression line have indeed been minimised, we also 
need the results that 


d 2 P " d 2 P ™ o d 2 P " 

= E 2 = ^ = E 2a h> and = E 2 *i 


(9a 2 


i=l 




dadb 


i = 1 


The hrst two of these are clearly positive; and it may be shown that 


d 2 P d 2 P (d 2 P\ 
da? ' db 2 1 dadb ) 


> 0 . 


EXAMPLE 

Determine the equation of the regression line of y on x for the following data, which shows 
the Packed Cell Volume, xmm, and the Red Blood Cell Count, y millions, of 10 dogs: 


X 

45 

42 

56 

48 

42 

35 

58 

40 

39 

50 

y 

6.53 

6.30 

9.52 

7.50 

6.99 

5.90 

9.49 

6.20 

6.55 

8.72 
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Solution 


X 

y 

xy 

x 2 

45 

6.53 

293.85 

2025 

42 

6.30 

264.60 

1764 

56 

9.52 

533.12 

3136 

48 

7.50 

360.00 

2304 

42 

6.99 

293.58 

1764 

35 

5.90 

206.50 

1225 

58 

9.49 

550.42 

3364 

40 

6.20 

248.00 

1600 

39 

6.55 

255.45 

1521 

50 

8.72 

436.00 

2500 

455 

73.70 

3441.52 

21203 


The regression line of y on x thus has equation y = a + bx where 


(21203) (73.70) - (455) (3441.52) 
(10)(21203) - (455) 2 


~ -0.645 


and 


_ (10) (3441.52) - (455) (73.70) 
(10)21203) - (455) 2 


Thus, 


y = 0.176a; -0.645 
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14.12.2 SIMPLIFIED CALCULATION OF REGRESSION LINES 


A simpler method of determining the regression line of y on x for a given set of data, is to 
consider a temporary change of origin to the point ( x,y ), where x is the arithmetic mean of 
the values Xi and y is the arithmetic mean of the values y l . 

RESULT 

The regression line of y on x contains the point (x,y). 

Proof: 

From the first of the normal equations, 


Sy 

n 


a + b 


E x 


n 


That is, 


y — a + bx. 


A change of origin to the point ( x,y ), with new variables X and Y, is associated with the 
formulae 


X = x — x and Y = y — y 

and, in this system of reference, the regression line will pass through the origin. 
Its equation is therefore 


Y = BX, 


where 


nE XY - E XE Y 
nE X 2 - (E X ) 2 
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However, 


E X — E (x — x) = E x — E x = nx — nx = 0 


and 


Ey = E (y — y) — Y, y — Y, y — ny — ny — 0. 


Thus, 


B 


s xy 

E X 2 ' 


Note: 

In a given problem, we make a table of values of Xi, yi, X l: Y t} XiYi and Xf. 
The regression line is then 


y — y — B(x — x) or y — BX + (y — Bx)] 

though, there may be slight differences in the result obtained compared with that from the 
earlier method. 

EXAMPLE 

Determine the equation of the regression line of y on x for the following data which shows 
the Packed Cell Volume, xmm, and the Red Blood Cell Count, y millions, of 10 dogs: 


X 

45 

42 

56 

48 

42 

35 

58 

40 

39 

50 

y 

6.53 

6.30 

9.52 

7.50 

6.99 

5.90 

9.49 

6.20 

6.55 

8.72 


Solution 

The arithmetic mean of the x values is x = 45.5 
The arithmetic mean of the y values is y — 7.37 
This gives the following table: 
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X 

y 

X = x — x 

Y — y — y 

XY 

X' 2 

45 

6.53 

-0.5 

-0.84 

0.42 

0.25 

42 

6.30 

-3.5 

-1.07 

3.745 

12.25 

56 

9.52 

10.5 

2.15 

22.575 

110.25 

48 

7.50 

2.5 

0.13 

0.325 

6.25 

42 

6.99 

-3.5 

-0.38 

1.33 

12.25 

35 

5.90 

-10.5 

-1.47 

15.435 

110.25 

58 

9.49 

12.5 

2.12 

26.5 

156.25 

40 

6.20 

-5.5 

-1.17 

6.435 

30.25 

39 

6.55 

-6.5 

-0.82 

5.33 

42.25 

50 

8.72 

4.5 

1.35 

6.075 

20.25 

455 

73.70 



88.17 

500.5 


Hence, 


B 


88.17 

500.5 


~ 0.176 


and so the regression line has equation 

y = 0.176a; + (7.37 - 0.176 x 45.5). 


That is, 


y = 0.176a; -0.638 


14.12.3 EXERCISES 

1. For the following tables, determine the regression line of y on x, assuming that y = a+bx. 
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X 

l 

3 

5 

10 

12 

y 

58 

55 

40 

37 

22 


2. To determine the relation between the normal stress and the shear resistance of soil, a 
shear-box experiment was performed, giving the following results: 


Normal Stress, x p.s.i. 

11 

13 

15 

17 

19 

21 

Shear Stress, y p.s.i. 

15.2 

17.7 

19.3 

21.5 

23.9 

25.4 


If y — a + bx, determine the regression line of y on x. 

3. Fuel consumption, y miles per gallon, at speeds of x miles per hour, is given by the 
following table: 


X 

20 

30 

40 

50 

60 

70 

80 

90 

y 

18.3 

18.8 

19.1 

19.3 

19.5 

19.7 

19.8 

20.0 


Assuming that 


b 

y = a+- , 
x 


determine the most probable values of a and b. 


14.12.4 ANSWERS TO EXERCISES 


1. (a) 

(b) 


y = 6.46 — 3.52a:; 


y = 54.20 + 0.52a;; 


y = 60.78 - 2.97a;. 


2 . 


y = 4.09 + 1.03a;. 


3 . 


a ~ —42 and b ~ 20. 
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UNIT 15.1 - ORDINARY DIFFERENTIAL EQUATIONS 1 
FIRST ORDER EQUATIONS (A) 

15.1.1 INTRODUCTION AND DEFINITIONS 


1. An ordinary differential equation is a relationship between an independent variable 
(such asi), a dependent variable (such as y ) and one or more ordinary derivatives of y 
with respect to x. 

There is no discussion, in Units 15, of partial differential equations, which involve 
partial derivatives (see Units 14). Hence, in what follows, we shall refer simply to 
“differential equations” . 

For example, 


dy -2x dy 2 dy n , dy 

— = xe , x— — y, x — f- y sm x — 0 and — 
da; da; da; da; 


x + y 
x-y 


are differential equations. 

2. The “order” of a differential equation is the order of the highest derivative which 
appears in it. 

3. The “general solution” of a differential equation is the most general algebraic relation- 
ship between the dependent and independent variables which satisfies the differential 
equation. 

Such a solution will not contain any derivatives; but we shall see that it will contain one 
or more arbitrary constants (the number of these constants being equal to the order of 
the equation). The solution need not be an explicit formula for one of the variables in 
terms of the other. 

4. A “boundary condition” is a numerical condition which must be obeyed by the 
solution. It usually amounts to the substitution of particular values of the dependent 
and independent variables into the general solution. 

5. An “initial condition” is a boundary condition in which the independent variable 
takes the value zero. 

6. A “particular solution” (or “particular integral”) is a solution which contains no 
arbitrary constants. 

Particular solutions are usually the result of appplying a boundary condition to a general 
solution. 
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15.1.2 EXACT EQUATIONS 

The simplest kind of differential equation of the first order is one which has the form 


dy 

da; 


= /(+)• 


It is an elementary example of an “exact differential equation” because, to find its 
solution, all that it is necessary to do is integrate both sides with respect to x. 

In other cases of exact differential equations, the terms which are not just functions of the 
independent variable only, need to be recognised as the exact derivative with respect to x of 
some known function (possibly involving both of the variables). 

The method will be illustrated by examples. 

EXAMPLES 


1. Solve the differential equation 

^ = 3a; 2 — 6a; + 5, 
da; 

subject to the boundary condition that y — 2 when x — 1. 

Solution 

By direct integration, the general solution is 

y = x 3 — 3x 2 + 5 x + C, 

where C is an arbitrary constant. 

From the boundary condition, 

2 = 1 — 3 + 5 + C, so that C = -1. 

Thus the particular solution obeying the given boundary condition is 

y = x 3 — 3x 2 + 5a; — 1. 
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2. Solve the differential equation 


x— +y = x , 
da; 

subject to the boundary condition that y = 4 when x = 2. 

Solution 

The left hand side of the differential equation may be recognised as the exact derivative 
with respect to x of the function xy. 

Hence, we may write 

d . . 3 

ii [xv) = x] 

and, by direct integration, this gives 

4 

X 

xy = -j + C, 

where C is an arbitrary constant. 

That is, 


x 3 C 
V — yr S • 

4 x 

Applying the boundary condition, 

4 — 2 + — 

2 ’ 

which implies that C — 4 and the particular solution is 


V = 



3. Determine the general solution to the differential equation 


sin x + sin y + x cos y 


dy 

da; 


0 . 


Solution 

The second and third terms on the right hand side may be recognised as the exact 
derivative of the function x sin y; and, hence, we may write 
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u. , . 

sm re + —(re sin y) = 0. 
arc 


By direct integration, we obtain 


— cos re + re sin y = C, 


where C is an arbitrary constant. 

This result counts as the general solution without further modification; but an explicit 
formula for y in terms of re may, in this case, be written in the form 


y 


Sin - 1 


C + cos re 
re 


15.1.3 THE METHOD OF SEPARATION OF THE VARIABLES 

The method of this section relates to differential equations of the first order which may be 
written in the form 


P(v)j L = Q(x). 
dre 

Integrating both sides with respect to re gives 

/ = I *2 0*0 dre. 

But, from the formula for integration by substitution in Units 12.3 and 12.4, this simplifies 
to 


j P{y ) d y = J Q{x ) dre. 


Note: 

The way to remember this result is to treat dre and d y, in the given differential equation, as 
if they were separate numbers; then rearrange the equation so that one side contains only 
y while the other side contains only re; that is, we separate the variables. The process is 
completed by putting an integral sign in front of each side. 
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EXAMPLES 


1. Solve the differential equation 


d y 

x t~ = y> 

ax 


subject to the boundary condition that y — 6 when x — 2. 

Solution 

The differential equation may be rearranged as 


ldy = 1 

ydx x ’ 

and, hence, 



giving 


In y = In x + C. 

Applying the boundary condition, 


In 6 = In 2 + C, 


so that 

C = In 6 — In 2 = In = In 3. 
The particular solution is therefore 

In y — In x T In 3 or y — 3x. 


Note: 

In a general solution where most of the terms are logarithms, the calculation can be 
made simpler by regarding the arbitrary constant itself as a logarithm, calling it In A, 
for instance, rather than C. In the above example, we would then write 

In y ~ In x + In A simplifying to y — Ax. 

On applying the boundary condition, 6 = 2A, so that A = 3 and the particular solution 
is the same as before. 
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2. Solve the differential equation 

oc{4: - x )^-~ y = 0, 

subject to the boundary condition that y — 7 when x = 2. 

Solution 

The differential equation may be rearranged as 


Id y = 1 

y da; x(4 — x) 

Hence, 

/>-/ 

or, using the theory of partial fractions, 


x(4 — x) 


da;; 


r 1 r 

- 1 i - 

~d y = 

4 | 4 

1 y J 

x 4 — x 


da;. 


The general solution is therefore 


In y — - In x — — ln(4 — x) + In A 


or 


y = A 


X \ 4 

4 — x 


Applying the boundary condition, 7 = A, so that the particular solution is 

y = 7 ( 


X \ 4 


4 — x 
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15.1.4 EXERCISES 


1. Determine the general solution of the differential equation 


dy 

da; 


a; 5 + 3e~ 2x . 


2. Given that differential equation 

2 d V , o 

x - — b 2xy = sm x 
ax 

is exact, determine its general solution. 

3. Given that the differential equation 


. i 2 o 

tana;- — f- y sec x = cos 2x 
ax 

is exact, determine the particular solution for which y = 1 when x — |. 

4. Use the method of separation of the variables to determine the general solution of each 
of the following differential equations: 


(a) 

(b) 


da; 

— = (a; - l)(x + 2); 
x(y-3)p- = 4 y. 


5. Use the method of separation of the variables to solve the following differential equations 
subject to the given boundary condition: 

(a) 

+ =x2y) 

where y = 2 when x — 1; 

(b) 


where y = 0 when x = 0. 


x 3 + (y + l) 2 


dy 

da; 


0, 
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15.1.5 ANSWERS TO EXERCISES 

1. 

x 6 3e~ 2x „ 

y= +C. 

y 6 2 

2. 

C — cos x 

y ~ X 2 ' 

3. 

3 2 

y — - cot x — cos x. 
y 2 

4. (a) 


(b) 

y = In [Ax 4 y 3 ]. 

5. (a) 

y 3 = 4(1 + x 3 )] 

(b) 

4[1 ~ (y + l) 3 ] = 3a: 4 
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UNIT 15.2 - ORDINARY DIFFERENTIAL EQUATIONS 2 
FIRST ORDER EQUATIONS (B) 

15.2.1 HOMOGENEOUS EQUATIONS 

A differential equation of the first order is said to be “homogeneous” if, on replacing x 
by \x and y by A y in all the parts of the equation except 4^, A may be removed from the 
equation by cancelling a common factor of A n , for some integer n. 

Note: 

Some examples of homogeneous equations would be 

(x + y)^- + (4x — y) = 0, and 2 xy*y- + ( x 2 + y 2 ) = 0, 

LI 'Aj U.ih 

where, from the first of these, a factor of A could be cancelled and, from the second, a factor 
of A 2 could be cancelled. 

15.2.2 THE STANDARD METHOD 

It turns out that the substitution 


y = vx 


( . . d y 

I givnig — 


v + x 
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always converts a homogeneous differential equation into one in which the variables can be 
separated. The method will be illustrated by examples. 

EXAMPLES 


1. Solve the differential equation 

d y . 0 

x— = x + 2y , 
da; 

subject to the condition that y — 6 when x = 6. 

Solution 

if y = vx , then ^ = v + x 4^, so that the differential equation becomes 


x 


+ X 



x + 2vx. 


1 




That is, 


dv 

v + x— — 1 + 2v 
ax 


or 


dv 

x— = 1 + v. 
ax 


On separating the variables, 


J 1 + v 


dv 


— da;, 
x 


giving 


ln(l + v) = In a; + In A, 


where A is an arbitrary constant. 

An alternative form of this solution, without logarithms, is 


Ax = 1 + v 

and, substituting back v = -, the solution becomes 

Ax = 1 + — 
x 

or 

y = Ax 2 — x. 

Finally, if y — 6 when x = 1, we have 6 = A — 1 and, hence, A = 7. 
The required particular solution is thus 

y = 7x 2 — x. 

2. Determine the general solution of the differential equation 

dv 

(x + y)- r + (Ax-y) = 0. 
da; 
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Solution 

If y — vx , then ^ = v + x^, so that the differential equation becomes 


(x + vx) 


+ x 



+ (4x 


vx) = 0. 


That is, 


(1 + v) 


+ X 



+ (4 - v) = 0 


or 


v + x 


dv 

dx 


v — 4 
v + 1 ’ 


On further rearrangement, we obtain 


dv v — 4 —4 — v 2 

x— = —v — — ; 

dx v + 1 v + 1 


and, on separating the variables, 


v + 1 
4 + u 2 


dv 


— dx 
x 


or 


Hence, 


1 r r 2v 2 

2 J A + v 2 4 + v 2 . 


- dx. 

x 


1 ' 
2 . 


ln(4 + v 2 ) + tan 


-i 


v 

2 


— In x + C, 


where C is an arbitrary constant. 

Substituting back v — -, gives the general solution 


1 

2 


In 4 + 



+ tan 1 



— In x + C. 
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3. Determine the general solution of the differential equation 

2 xy^- + (x 2 + y 2 ) = 0. 
da; 

Solution 

If y — vx, then ^ = v + x^, so that the differential equation becomes 


That is, 


or 


2vx 2 (v + x \ + (x 2 + v 2 x 2 ) = 0. 


2v (v + + (1 + v 2 ) = 0 


2vx^- = — (1 + 3v 2 ). 
ax 


On separating the variables, we obtain 

r 2v 


1 + 3v 2 


dx = — — da;, 


x 


which gives 


ln(l + 3v 2 ) = — In a; + In A, 


where A is an arbitrary constant. 
Hence, 


or, on substituting back v — -, 


(1 + 3v 2 )s = 


A 


x 


2 _L q,,2\ 3 


x + 3 y 
x 2 


= Ax, 


which can be written 


x 2 + 3 y 2 = Bx 5 , 


where B = A 3 . 
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15.2.3 EXERCISES 


Use the substitution y = vx to solve the following differential equations subject to the given 
boundary condition: 


1. 


where y = 3 when x = —2. 

2 . 


where y = 5 when x — 0. 

3. 


where y = 1 when x — 2. 

4. 


(2 y - x) 


dy 

dx 


2x + y, 


/ 2 2 \ C ^ 

(•£' -y )-j^ = xy, 


x 3 + y 3 


3 xy 2 


d y 

dx’ 


x(x 2 + y 2 )j^ = 2 y 3 , 


where y — 2 when x — 1. 

5. 



(y + \Jx 2 -y 2 ) = 0, 


where y = 0 when x = 1. 
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15.2.4 ANSWERS TO EXERCISES 


1 . 


2 . 


3 . 

4 . 

5 . 


y 2 — xy — x 2 = 11 . 


y — be zy 7 . 
x 3 - 2 y 3 = 3x. 


3 x 2 y = 2(y 2 — x 2 ). 




= x. 
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UNIT 15.3 - ORDINARY DIFFERENTIAL EQUATIONS 3 
FIRST ORDER EQUATIONS (C) 

15.3.1 LINEAR EQUATIONS 

For certain kinds of first order differential equation, it is possible to multiply the equation 
throughout by a suitable factor which converts it into an exact differential equation. 

For instance, the equation 


dy 

da; 


J- o 

H — y = x 
x 


may be multiplied throughout by x to give 


dy v 3 

x- — V y = x . 
da; 


It may now be written 


_d_ 

da; 


{xy) 


= x 


3 


and, hence, it has general solution 


x 


xy = -j + C, 


where C is an arbitrary constant. 

Notes: 

(i) The factor, x which has multiplied both sides of the differential equation serves as an 
“integrating factor” , but such factors cannot always be found by inspection. 

(ii) In the discussion which follows, we shall develop a formula for determining integrating 
factors, in general, for what are known as “linear differential equations” . 
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DEFINITION 


A differential equation of the form 


d y 

da; 


+ P(x)y = Q(x) 


is said to be “linear” . 

RESULT 

Given the linear differential equation 

!r + P(x)y = Q(x), 


the function 


e f p (x) da; 


is always an integrating factor; and, on multiplying the differential equation throughout by 
this factor, its left hand side becomes 


d 

da; 


y x ef p(x) dx . 


Proof 

Suppose that the function, R(x), is an integrating factor; then, in the equation 

R(x)— + R(x)P(x)y = R(x)Q(x), 
dx 

the left hand side must be the exact derivative of some function of x. 

Using the formula for differentiating the product of two functions of x, we can make it the 
derivative of R(x)y provided we can arrange that 

R(x)P(x) = -^[R(a;)]. 
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But this requirement can be interpreted as a differential equation in which the variables R(x) 
and x may be separated as follows: 


/ 


W) ,m(x) = / P(x) 


Hence, 

\nR(x) = I P(x) dx. 

That is, 

R(x) = el p ^ x) dx , 

as required. 

The solution is obtained by integrating the formula 

-r~[y x R(x)] = R(x)P(x). 


Note: 

There is no need to include an arbitrary constant, C, when P(x) is integrated, since it 
would only serve to introduce a constant factor of e c in the above result, which would then 
immediately cancel out on multiplying the differential equation by R(x). 

EXAMPLES 


1. Determine the general solution of the differential equation 


dy 

da; 


-L o 

H — y = x . 
x 


Solution 

An integrating factor is 


e f ^ dx = e lnx 


3 


= x. 



On multiplying throughout by the integrating factor, we obtain 


and so, 





where C is an arbitrary constant. 

2. Determine the general solution of the differential equation 


Solution 

An integrating factor is 


dy 

da; 


+ 2 xy 


2e~ x 


2 


e f 2 % _ e x 2 


Hence, 


giving 




2, 


ye° 2 = 2 x + C, 


where C is an arbitrary constant. 


15.3.2 BERNOUILLI’S EQUATION 

A similar type of differential equation to that in the previous section has the form 

;! y + P(x)y = Q(x)y n . 
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It is called “Bernouilli’s Equation” and may be converted to a linear differential equation 
by making the substitution 


z = 


y 


1 —n 


Proof 

The differential equation may be rewritten as 


y- nC ^ c + P(x)y 1 ~ n = Q(x) 


Also, 


d z 

da; 


(1 - n)y n 


d y 

da; 


Hence the differential equation becomes 


1 dz 
1 — n da; 


P{x)z = Q(x). 


That is, 


— + (1 - n)P(x)z = (1 - n)Q(x), 
dx 

which is a linear differential equation. 

Note: 

It is better not to regard this as a standard formula, but to apply the method of obtaining 
it in the case of particular examples. 

EXAMPLES 


1. Determine the general solution of the differential equation 


dy 3 -a; 2 

xy- — = y e 
dx 
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Solution 

The differential equation may be rewritten 


.-3 d v 


-2 -x* 


-y — + x.y =e~ 
dx 


Substituting z = y 2 , we obtain ^ = —2 y 3 ^ and, hence, 


dx & dx 

ldz 


2 da; 


+ xz = e 


or 


^ + 2xz = 2e~ x2 . 
dx 


An integrating factor for this equation is 


3 J 2x dx x 2 


Thus, 


da; 


(. ze x 2 ) 


= 2 , 


giving 


ze x2 = 2 x + C, 


where C is an arbitrary constant. 
Finally, replacing z by y~ 2 , 


y 2 x + C' 

2. Determine the general solution of the differential equation 


fly + y 

da; x 


= xy 2 . 
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Solution 

The differential equation may be rewritten 



1 

+ ~-y 

X 


-1 


= X. 


On substituting z = y 1 we obtain ^ = —y 2 qy so that 

d;; 1 

— h -.2 = X 

da; x 
or 

dz 1 

.z = —x. 

da; x 

An integrating factor for this equation is 


e f (~s) dx = e ~ lnx = I. 

x 


Hence, 



giving 

— — — x + C, 
x 


where C is an arbitrary constant. 

The general solution of the given differential equation is therefore 


1 

xy 


—x + C or y 


1 

Cx — x 2 ' 


15.3.3 EXERCISES 

Use an integrating factor to solve the following differential equations subject to the given 
boundary condition: 

1 . 

where y — 10 when x = 0 


3 d f + 2y = 0, 
dx 
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2 . 


where y — 4 when x — 0. 

3. 


where y — 2 when x — — 1. 

4. 


where y = 0 when x = — 1. 

5. 


where y = § when re = |. 

6 . 


where y = 0 when x = 1. 
7. 


where y = 1 when re = 0. 



10 , 


dy 

di + 


y_ 

X 


3x, 


dre 1 — x 


x 


d y 

- — b y cot x = cos re, 

dre 


/ 2 , t n &y 

[x + 1 ) — xy = x, 

ax 


3y - 2^ = y 3 e to , 

dre 


2y — x 


dy 

dre 


x{;x - 1 )y 4 , 


where y 3 = 14 when x = 1. 



15.3.4 ANSWERS TO EXERCISES 


1 . 


2 . 

3. 

4. 


5. 


6 . 


7. 


y = lOe-t*. 


y = 6ei x -2. 


yx = x 3 — 1. 


y = 2( 1-a; )( 1 + a: ) 2 - 


y = 


sin x 


2 

sin x 


y = 1 + x 1 — ^2(1 + x 2 ). 


7e 3x 
e 7x + 6 


56x 6 

2 lx 6 - 24x 7 + 7' 
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UNIT 15.4 - ORDINARY DIFFERENTIAL EQUATIONS 4 
SECOND ORDER EQUATIONS (A) 

15.4.1 INTRODUCTION 

In the discussion which follows, we shall consider a particular kind of second order ordinary 
differential equation which is called “linear, with constant coefficients”; it has the 
general form 


d2 y , h d y j \ 

a d? + 6 dt +q, = /w ' 

where a, b and c are the constant coefficients. 

The various cases of solution which arise depend on the values of the coefficients, together 
with the type of function, f(x), on the right hand side. These cases will now be dealt with 
in turn. 

15.4.2 SECOND ORDER HOMOGENEOUS EQUATIONS 

The term “homogeneous” , in the context of second order differential equations, is used to 
mean that the function, f(x), on the right hand side is zero. It should not be confused with 
the previous use of this term in the context of first order differential equations. 

We therefore consider equations of the general form 


d 2 y , dy 
«Ty + b-f- + cy 
dx z dx 


0 . 


Note: 

A very simple case of this equation is 


d 2 y_ 

dx 2 


= 0, 


which, on integration twice, gives the general solution 


y = Ax + B, 
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where A and B are arbitrary constants. We should therefore expect two arbitrary constants 
in the solution of any second order linear differential equation with constant coefficients. 

The Standard General Solution 

The equivalent of 


d 2 y , dy 

a ~A + b-f- + cy = 0 
dx z dx 


in the discussion of first order differential equations would have been 


, dy , dy c 

b- b cy = 0; that is, b jy = 0 

da; da; b 


and this could have been solved using an integrating factor of 


e I c t dx = e i x , 


giving the general solution 


y = Ae 


where A is an arbitrary constant. 

It seems reasonable, therefore, to make a trial solution of the form y = Ae mx , where A ^ 0, 
in the second order case. 

We shall need 


dy 

da; 


Ame mx and 


d 2 y 

da; 2 


Am 2 e mx 


Hence, on substituting the trial solution, we require that 


aAm 2 e mx + bArne mx + cAe mx = 0; 
and, by cancelling Ae mx , this condition reduces to 
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am 2 + bm + c = 0, 


a quadratic equation, called the “auxiliary equation”, having the same (constant) coeffi- 
cients as the orginal differential equation. 

In general, it will have two solutions, say m = mj and m = m 2 , giving corresponding 
solutions y = Ae mix and y = Be m2X of the differential equation. 

However, the linearity of the differential equation implies that the sum of any two solutions 
is also a solution, so that 


y = Ae mix + Be m2X 


is another solution; and, since this contains two arbitrary constants, we shall take it to be 
the general solution. 

Notes: 

(i) It may be shown that there are no solutions other than those of the above form though 
special cases are considered later. 

(ii) It will be possible to determine particular values of A and B if an appropriate number 
of boundary conditions for the differential equation are specified. These will usually be a set 
of given values for y and ^ at a certain value of x. 

EXAMPLE 

Determine the general solution of the differential equation 


d 2 y d y 

dx 2 da; 


+ 6 y 


0 


and also the particular solution for which y = 2 and (p = —5 when x = 0. 

Solution 

The auxiliary equation is m 2 + 5 m + 6 = 0, 
which can be factorised as 
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(m + 2 )(m + 3) = 0. 


Its solutions are therefore m = —2 and m = —3. 

Hence, the differential equation has general solution 

y = Ae~ 2x + Be~ 3x : 

where A and B are arbitrary constants. 

Applying the boundary conditions, we shall also need 

d J J = —2Ae~ 2x - 3Be~ 3x . 
ax 

Hence, 


2 = A + B, 

-5 = -2A-3B 

giving A = 1, B = 1 and a particular solution 

y = e~ 2x + e~ 3x . 

15.4.3 SPECIAL CASES OF THE AUXILIARY EQUATION 
(a) The auxilary equation has coincident solutions 

Suppose that both solutions of the auxiliary equation are the same number, m\. 

In other words, the quadratic expression am 2 + bm + c is a “perfect square” , which means 
that it is actually a(m — mi) 2 . 

Apparently, the general solution of the differential equation is 

y = Ae mix + Be mix , 
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which does not genuinely contain two arbitrary constants since it can be rewritten as 


y = Ce mix where C = A + B. 


It will not, therefore, count as the general solution, though the fault seems to lie with the 
constants A and B rather than with m 

Consequently, let us now examine a new trial solution of the form 


y 


ze mix , 


where z denotes a function of x rather than a constant. 
We shall also need 


dy 

da; 


= zmie miX 


+ e 


m\x 


dz 

da; 


and 


pL=zm\e^ + 2m ie ^ + e^AL 
dar da; dar 


On substituting these into the differential equation, we obtain the condition that 


a m ix 


d^ dV 


a zm 1 + 2mi - — h 


d^' 


J 1 J 9 / + & zm i + T~ + 

da; ax z / V da; / 


= 0 


or 


z{am\ + brrii + c) + 


d^ 

da; 


(2ami + b) + a 


d 2 ;? 

da ; 2 


0. 


The first term on the left hand side of this condition is zero since m\ is already a solution 
of the auxiliary equation; and the second term is also zero since the auxiliary equation, 
am 2 + bm + c = 0, is equivalent to a(m — mi) 2 = 0; that is, am 2 — 2amim + am 2 = 0. Thus 
b = —2arrii. 
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We conclude that 4-f = 0 with the result that z = Ax + B, where A and B are arbitrary 
constants. 

The general solution of the differential equation in the case of coincident solutions to the 
auxiliary equation is therefore 


y = (Ax + B)e mix . 


EXAMPLE 

Determine the general solution of the differential equation 


, d 2 y , dy 
4 — - + 4— 
da; 2 da; 


+ y 


0 . 


Solution 

The auxilary equation is 


4m 2 + 4m + 1 = 0 or (2m + l) 2 = 0 


and it has coincident solutions at m — —\. 
The general solution is therefore 


y = (Ax + B)e 2 + 

(b) The auxiliary equation has complex solutions 

If the auxiliary equation has complex solutions, they will automatically appear as a pair of 
“complex conjugates”, say m — a dzj/3. 

Using these two solutions instead of the previous mi and m 2 , the general solution of the 
differential equation will be 


y = Pe {a+j/3)x + Qe {a ~ jl3)x , 

where P and Q are arbitrary constants. 
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But, by properties of complex numbers, a neater form of this result is obtainable as follows: 


y = e aa; [P(cos fix + j sin fix) + Q(cos fix — j sin fix)] 


or 

y = e ox [(P + Q ) cos /3x + j (P — Q ) sin /3a:] . 

Replacing P+Q and j(P—Q) (which are just arbitrary quantities) by A and B, we obtain the 
standard general solution for the case in which the auxiliary equation has complex solutions. 

It is 


y = e ax [A cos fix + B sin fix] . 


EXAMPLE 

Determine the general solution of the differential equation 


d 2 y 

dx 2 


Ay 

6 di 


+ 13 y 


0 . 


Solution 

The auxiliary equation is 


m 2 — 6 m + 13 = 0, 


which has solutions given by 


m = 


-(-6) ± ^(-b) 2 -4x13x1 
2x1 


6 ± j'4 
2 


3±J'2. 
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The general solution is therefore 


y = e 3x [A cos 2 x + B sin 2x \ , 

where A and B are arbitrary constants. 

15.4.4 EXERCISES 


1. Determine the general solutions of the following differential equations: 


(b) 


(c) 


^| + T^ + 12 9 = 0; 

dx z ax 


d 2 r dr 

_ + 6 _ + 9r = 0; 


d' 2 « „ 

dF +4 S + 5 " = 0 - 

2. Solve the following differential equations, subject to the given boundary conditions 
(a) 

d2 y q d v i o - n 

— ^ % — 0 , 

dx z ax 


where y = 2 and 4^ = 1 when x = 0; 


(b) 


d?x , dx 

dr “ 4 * +3l = 0 - 


where x = 3 and qf = 5 when t — 0; 


d 2 z cb 

4— -12- + 9z = 0, 
ds" ds 


where z — 1 and 4^ = | when s = 0; 



(d) 


d 2 r 

d J 2 



+ 2 r 


0, 


where r — 5 and 4^ = 7 when 0 = 0. 


15.4.5 ANSWERS TO EXERCISES 


1. (a) 

y = Ae~ 3x + Be~ 4x ; 

(b) 

r = (A0 + B)e~ 39 ; 

(c) 

6 = e~ 2t [A cos 2t + B sin 2 1] 

2. (a) 

y = 3e x - e 2x ] 

(b) 

x = 2e t A- e 3t ; 

(c) 

2 = (s + l)e5 s ; 

(d) 



r = e e [5 cos 0 + 2 sin 0 . 
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UNIT 15.5 - ORDINARY DIFFERENTIAL EQUATIONS 5 
SECOND ORDER EQUATIONS (B) 

15.5.1 NON-HOMOGENEOUS DIFFERENTIAL EQUATIONS 

The following discussion will examine the solution of the second order linear differential 
equation 


d 2 y ,, d y, n \ 

“d ^ + b T x +cy = nx) ’ 


in which a, b and c are constants, but f(x ) is not identically equal to zero. 

The Particular Integral and Complementary Function 

(i) Suppose that y = u(x ) is any particular solution of the differential equation; that is, it 
contains no arbitrary constants. In the present context, we shall refer to such particular so- 
lutions as “particular integrals” and systematic methods of finding them will be discussed 
later. 

It follows that 


d 2 u d u 

a— — + b - — b cu 
dx z ax 


/(*)• 


(ii) Suppose also that we make the substitution y = u(x) + v(x) in the original differential 
equation to give 


d 2 (u + u) 7 d(« + u) . . . 

a ^ — - + b ; + c(u + v) = f(x). 


da; 2 


da; 


That is, 


and, hence, 


d 2 u du d 2 u dv 

a— — + b— — b cud- a— - + b - — b cv 
dar da; da;^ da; 


f(*)> 


d 2 v dn 

a— r + b - — b cv = 0. 
dx- da; 
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This means that the function v(x) is the general solution of the homogeneous differential 
equation whose auxiliary equation is 


am 2 + bin + c = 0. 

In future, v(x) will be called the “complementary function” in the general solution of 
the original (non-homogeneous) differential equation. It complements the particular integral 
to provide the general solution. 

Summary 


General solution = particular integral + complementary function. 


15.5.2 DETERMINATION OF SIMPLE PARTICULAR INTEGRALS 
(a) Particular integrals, when f(x) is a constant, k. 

For the differential equation 


d 2 y , dy 

+ b-f- + cy 
dx z dx 


k, 


it is easy to see that a particular integral will be y = since its first and second derivatives 
are both zero, while cy = k. 

EXAMPLE 

Determine the general solution of the differential equation 


d 2 y 

dx 2 



+ 10 y 


20 . 


Solution 

(i) By inspection, we may observe that a particular integral is y — 2. 

(ii) The auxiliary equation is 

m 2 + 7m + 10 = 0 or (m + 2)(m + 5) = 0, 
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having solutions m = —2 and m = —5. 
(iii) The complementary function is 


Ae~ 2x + Be~ bx , 

where A and B are arbitrary constants. 

(iv) The general solution is 


y = 2 + Ae" 2 * + Be~ 5x . 

(b) Particular integrals, when f(x) is of the form px + q. 

For the differential equation 


d 2 y , dy 

a TA + b A L + c y 

ax z ax 


px + q, 


it is possible to determine a particular integral by assuming one which has the same form 
as the right hand side; that is, in this case, another expression consisting of a multiple of x 
and constant term. The method is, again, illustrated by an example. 

EXAMPLE 

Determine the general solution of the differential equation 


d 2 y 

dx 2 



+ 28 y 


84r - 5. 


Solution 

(i) First, we assume a particular integral of the form 


y = ax + /?, 


which implies that ^ = a and = 0. 

r ax ax z 
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Substituting into the differential equation, we require that 


— 11a + 28(aa; + (5) = 84a; — 5. 

Hence, 28a = 84 and —11a + 28 (3 = —5, giving a = 3 and (5 = 1. 
Thus, the particular integral is 


y = 3x + 1. 

(ii) The auxiliary equation is 

m 2 — 11 m + 28 = 0 or (m — 4 )(m — 7) = 0, 

having solutions m = 4 and m = 7. 

(iii) The complementary function is 

Ae 4x + Be 7x , 

where A and B are arbitrary constants. 

(iv) The general solution is 


y = 3x + l + He 4x + Be 7x . 


Note: 

In examples of the above types, the complementary function must not be prefixed by u y =” , 
since the given differential equation, as a whole, is not normally satisfied by the complemen- 
tary function alone. 
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15.5.3 EXERCISES 


1. Determine the general solutions of the following differential equations: 
(a) 

d 2 y d y 


dx 2 dx 


2 y = 6; 


(b) 


d 2 y 
dx 2 


+ 16 y = 7; 


(d) 


d 2 y dy 

3 t4 — y = x + i-, 

dx z dx 


pi 6 p + S)y = Kx + 

dx z dx 


2. Solve, completely, the following differential equations, subject to the given boundary 
condidtions: 


2^ -7^ -4, = 100, 
dx z dx 


where y = —26 and 4 s = 5 when x = 0; 


(b) 


^ + 4^ + 4, = 12* + 16, 
dx z dx 


where y — 0 and 4v = 4 when x = 0 


dx 


(c) 


- 6^r + 10y = 10x + 14, 
dx z dx 


where y = 3 and Pi = 2 when x = 0 


dx 
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15.5.4 ANSWERS TO EXERCISES 


1. (a) 

(b) 

(c) 

(d) 


y = -S + Ae x + Be 2x ; 

7 

y = 1- A cos 4 x + B sin 4x; 

16 

y — 1 — x + Ae x + Be~^ x ] 
y = 2x + 5 + (Ax + B)e 3x . 


2. (a) 

(b) 

(c) 


y = -25 + e 4x - 2e2 x ; 
y = 3a: + 1 — (a; + l)e _2a; ; 
y = x + 2 + e 3a: (cosa; — 2 sinx). 
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UNIT 15.6 - ORDINARY DIFFERENTIAL EQUATIONS 6 
SECOND ORDER EQUATIONS (C) 

15.6.1 RECAP 

For the differential equation 


d 2 V , , dy , . 

+ + ^ = /W ’ 


it was seen, in Unit 15.5, that 

(a) when f(x) = k, a given constant, a particular integral is y — 

(b) when /(a;) = px + q, a linear function in which p and g are given constants, it is possible 
to obtain a particular integral by assuming that y also has the form of a linear function; that 
is, we make a “trial solution” , y — ax + (3. 

15.6.2 FURTHER TYPES OF PARTICULAR INTEGRAL 

We now examine particular integrals for other cases of f(x), the method being illustrated 
by examples. Also, for reasons relating to certain problematic cases discussed in Unit 15.7, 
we shall determine the complementary function before determining the particular integral. 


1 . f(x) = px 2 + qx + r, a quadratic function in which p, q and r are given constants; 
P ± o. 


Trial solution : y = ax 2 + f3x + 7 . 


Note: 

This is the trial solution even if q or r (or both) are zero. 

EXAMPLE 

Determine the general solution of the differential equation 

2 dy _ 7 <^_ Ay = 4x 2 + 1Qx _ 23 


Solution 

The auxilary equation is 


2 m 2 — 7m — 4 = 0 or (2m + l)(m — 4) = 0, 


1 



having solutions rn — 4 and m— — 

Thus, the complementary function is 

He 4x + Be~* x , 

where A and B are arbitrary constants. 

To determine a particular integral, we may make a trial solution of the form 
y = ax 2 + (3x + 7 , giving ^ = 2ax + (3 and = 2a. 

We thus require that 

4 a — 14aa; — 7 (3 — 4 ax 2 — 4/Tr — 4y = 4a; 2 + 10a; — 23. 


That is, 


— 4aa ; 2 — (14a + 4/5) x + 4a — 7/3 — 4y = 4a ; 2 + 10a; — 23. 

Comparing corresponding coefficents on both sides, this means that 

—4a = 4, — (14a + 4/3 ) = 10 and 4a — 7/3 — 4y = —23, 

which give a = —1, (3 — 1 and 7 = 3. 

Hence, the particular integral is 


y = 3 + x — x 2 . 

Finally, the general solution is 

y = 3 + x — x 2 + He 4x + Be~* x . 

2. f(x) = p sin kx + qcoskx, a trigonometric function in which p, q and k are given 
constants. 


Trial solution : y — a sin kx + (3 cos kx. 


Note: 

This is the trial solution even if p or q is zero. 
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EXAMPLE 

Determine the general solution of the differential equation 


d 2 y Ay 
ix^~ 2 ix +2v 


8 cos 3a; — 19 sin 3a;. 


Solution 

The auxiliary equation is 


m 2 — 2m + 2 = 0, 

which has complex number solutions given by 

2±v/4^8 1-L . 

m = = 1 + 7 . 

2 J 

Hence, the complementary function is 

e x ( A cos x + B sin x ) , 
where A and B are arbitrary constants. 

To determine a particular integral, we may make a trial solution of the form 

y = a sin 3 x + (3 cos 3a;, 

giving ^ = 3 a cos 3a; — 3/3 sin 3a; and 4-4( = —9a sin 3a; — 9 f3 cos 3a;. 

We thus require that 

—9a sin 3a; — 9 (3 cos 3a; — 6a cos 3a; + 6 (3 sin 3a; + 2a sin 3a; + 2 (3 cos 3x = 8 cos 3a; — 19 sin 3a;. 
That is, 


(—9a + 6 (3 + 2a) sin 3a; + (—9(3 — 6a + 2(3) cos 3x = 8 cos 3a; — 19 sin 3a;. 
Comparing corresponding coefficients on both sides, we have 


—7a + 6/3 = —19, 
-6a -7(3 = 8. 
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These equations are satisfied by a = 1 and (3 = —2, so that the particular integral is 

y = sin 3x — 2 cos 3a;. 


Finally, the general solution is 


y = sin 3a; — 2 cos 3a; + e x (A cos x + B sin x). 

3. f(x) = pe kx , an exponential function in which p and k are given constants. 


Trial solution : y = ae kx . 


EXAMPLE 

Determine the general solution of the differential equation 


Solution 

The auxiliary equation is 


9 fy 

Ax 2 



+ y 


50e 3x . 


9 m 2 + 6 m + 1 = 0 or (3m + l) 2 = 0, 

which has coincident solutions at m — — |. 

The complementary function is therefore 

(Ax + B)e~^ x . 

To find a particular integral, we may make a trial solution of the form 

y = ae 3x , 

which gives ^ = 3 ae 3x and = 9 ae 3x . 

Hence, on substituting into the differential equation, it is necessary that 


&lae 3x + l&ae 3x + ae 31 = 50e 3x . 

That is, lOOo; = 50, from which we deduce that a = \ and a particular integral is 

1 


y = -e 
y 2 


3a; 


Finally, the general solution is 


y = -e 3x + (Ax + B)e~^ x . 


4 



4 . f(x) = p sinh kx + qcoshkx , a hyperbolic function in which p, q and k are given 
constants. 


Trial solution : y = a sinh kx + (3 cosh kx. 


Note: 

This is the trial solution even if p or q is zero. 

EXAMPLE 

Determine the general solution of the differential equation 

— 5^ + 6y = 93 cosh 5a; — 75 sinh 5x. 
daH da; 

Solution 

The auxiliary equation is 


m 2 — 5 m + 6 = 0 or (m — 2 )(m — 3) = 0, 
which has solutions m — 2 and m = 3 so that the complementary function is 


Ae 2 * + 5e 3a; , 

where A and i? are arbitrary constants. 

To determine a particular integral, we may make a trial solution of the form 


y = a sinh 5 x + /3 cosh 5a;, 

giving ^ = 5a cosh 5a; + 5(3 sinh 5a; and = 25a sinh 5a; + 25/3 cosh 5a;. 
Substituting into the differential equation, the left-hand-side becomes 


25a sinh 5a; + 25/3 cosh 5a; — 25a cosh 5a; — 25/3 sinh 5a; + 6a sinh 5a; + 6/3 cosh 5a;. 
This simplifies to 


(31a — 25/3) sinh 5a; + (31 (3 — 25a) cosh 5a;, 

so that we require 


31a - 25/3 = -75, 
-25a + 31/3 = 93, 

5 



and these are satisfied by a = 0 and (3 = 3. 
The particular integral is thus 


y = 3 cosh 5a; 


and, hence, the general solution is 

y — 3 cosh 5a; + Ae 2x + Be Zx . 

5. Combinations of Different Types of Function 

In cases where f(x) is the sum of two or more of the various types of function discussed 
previously, then the particular integrals for each type (determined separately) may be 
added together to give an overall particular integral. 

15.6.3 EXERCISES 


1. Determine the general solution for each of the following differential equations: 


(a) 


(b) 


(c) 


(d) 


d 2 y _ dy 

d^ + 5 di 


+ 4 y 


4a; 2 + 2x — 4; 


d 2 y , dy 

— j — — + 4- — h 5y = 8 cos 2x — sin 2a;; 
da; 2 da; 


4 I!| + 1 2^ + 9y = 27e- 

da;- da; 


d 2 y „ dy 
d^ - 3 ch~ V 


cosh 3a; — sinh 3a;. 


6 



2. Solve completely the following differential equations subject to the given boundary 
conditions: 


d 2 y 

— r — y = 10 — 5re 2 — x + 16e~ 3x , 
dx 2 


where y — 13 and 4^ = —2 when x = 0; 


(b) 


d 2 y dv 

4 — - + 8 3y = 9x + 6 cosx — 19 sin re, 

dx 2 dre 


where y = —2 and = 0 when re = 0. 


do: 


15.6.4 ANSWERS TO EXERCISES 


1. (a) 

(b) 

(c) 

(d) 

2. (a) 

(b) 


y = x 2 - 2rc + 1 + + Be ~ Ax ; 


y — sin2re + e x (A cos x + B sin x); 


y = 3e x + (Ax + B)e 2= 


y = - (cosh 3x — sinh 3rc) + Ae— 2x + Re 


y = 5rc 2 + re + 2e 3j:: + 3e x — 2e x ; 


5a: 


y = 3rc — 8 + 2 cos re + sin re + 2e + 2e ^ 
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UNIT 15.7 - ORDINARY DIFFERENTIAL EQUATIONS 7 
SECOND ORDER EQUATIONS (D) 

15.7.1 PROBLEMATIC CASES OF PARTICULAR INTEGRALS 

Difficulties can arise if all or part of any trial solution would already be included in the 
complementary function. We illustrate with some examples: 

EXAMPLES 


1. Determine the complementary function and a particular integral for the differential 
equation 


d 2 y_ 
dx 2 


odf/ 

da; 


+ 2 y = e 2x . 


Solution 

The auxiliary equation is m 2 — 3 m + 2 = 0, with solutions m — 1 and m — 2 and hence 
the complementary function is Ae x + Be 2x , where A and B are arbitrary constants. 

A trial solution of y = ae 2x gives 


d y 

da; 


2 ae 2x and 


d 2 y 
dx 2 


4ae 2x 


and, on substituting these into the differential equation, it is necessary that 


4ae 2x - 6ae 2x + 2oe 2a: = e 2 + 

That is, 0 = e 2x which is impossible. 

However, if y — ae 2x has proved to be unsatisfactory, let us investigate, as an alternative, 
y = F(x)e 2x (where F(x) is a function of x instead of a constant). 

We have 

^ = 2 F(x)e 2x + F'(x)e 2x 
dx 

and, hence, 


d 2 y 
dx 2 


4 F(x)e 2x + 2 F\x)e 2x + F"(x)e 2x + 2 F'(x)e 2x . 


1 



On substituting these into the differential equation, it is necessary that 

(4 F(x) + 2 F\x) + F"(x) + 2 F\x) - 6 F(x) - 3 F'(x) + 2 F(x)) e 2x = e 2x . 
That is, 


F"(x) + F\x) = 1, 

which is satisfied by the function F{x) = x and thus a suitable particular integral is 

9 r 

y = xe . 


Note: 

It may be shown, in other cases too that, if the standard trial solution is already 
contained in the complementary function, then it is is necessary to multiply it by x in 
order to obtain a suitable particular integral. 


2. Determine the complementary function and a particular integral for the differential 
equation 


d 2 y 

—— + y = sin x. 
dx z 


Solution 

The auxilary equation is m 2 + 1 = 0, with solutions m = ±j and, hence, the comple- 
mentary function is A sin x + B cos a;, where A and B are arbitrary constants. 

A trial solution of y — a sin x + (3 cos x gives 


d 2 _y 
dx 2 


— a sin a; — (3 cosx ; 


and, on substituting into the differential equation, it is necessary that 0 = sin x, which 
is impossible. 

Here, we may try y = x(o;sinx + (3 cosx), giving 


— = a sin x + (3 cos x + x(a cos x — (3 sin x) 
dx 


(i a — (3x) sin a; + ((3 + ax) cosx 


and, therefore, 


d 2 y 
dx 2 


(i a—(3x ) cos x — (3 sin x — (/3 + ax) sinx + ocosx = (2 a — (3x) cosx— (2/3 + ax) sinx. 
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Substituting into the differential equation, we thus require that 


(2 a — (3x) cos x — ( 2(3 + ax) sin x + x(a sin x + (3 cos x) = sin x, 
which simplifies to 


2 a cos x — 2(3 sin x = sin x. 


Thus 2a = 0 and —2(3 = 1. 

An appropriate particular integral is now 


3. Determine the complementary function and a particular integral for the differential 
equation 

2 y , Ay 


d 2 " 


9 — - + 6 
da; 2 da; 


+ y == 50e" 


Solution 

The auxiliary equation is 9 m 2 + 6 m + 1 = 0, or (3m + l) 2 = 0, which has coincident 
solutions m — — | and so the complementary function is 


(. Ax + B)e 3 X . 

In this example, both e~^ x and xe~^ x are contained in the complementary function. 
Thus, in the trial solution, it is necessary to multiply by a further x, giving 


y 


2 — -r 

ax e s . 


We have 


and 


d y 

da; 


2aa;e 


-x 2 es x 

3 


Cl IJ —-'T ^ —-T " — T -1 2 —-T 

; = 2 ae a axe 3 axe $ + -ax e 3 . 

da; 2 3 3 9 
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Substituting these into the differential equation, it is necessary that 

(l8a — 12 ax + ax 2 + 12 ax — ‘lax 2 + ax 2 ') e~^ x = SOe - ^ 

and, hence, 18a = 50 or a = y. 

An appropriate particular integral is 


^ 2 — -x 

y = — are 3 . 
y 9 

4. Determine the complementary function and a particular integral for the differential 
equation 

P^-5p- + 6y = smh2x. 

dx 2 da; 

Solution 

The auxiliary equation is m 2 — 5 m + 6 = 0 or (m — 2 )(m — 3) = 0 which has solutions 
m — 2 and m — 3 and, hence, the complementary function is 

Ae 2x + Be 3x . 

However, since sinh2a; = |(e 2a: — e~ 2x ), part of it is contained in the complementary 
function and we must find a particular integral for each part separately. 

(a) For ^ e 2x , we may try 


2 r 

y = xae , 


giving 


and 


^ = ae 2x + 2 xae 2x 
dx 


d 2 v 

= 2ae 2x + 2ae 2x + 4 xae 2x . 

dx 2 


Substituting these into the differential equation, it is necessary that 


(4a + 4a;a — 5a — ICtea + 6xa) e 2x = 



5 


which gives a = — 


4 



(b) For —\e 2x , we may try 


V = Pe~ 2x , 


giving 


and 


d y 

da; 


-2pe~ 2x 


d 2 y 

dx 2 


Af3e~ 2x . 


Substituting these into the differential equation, it is necessary that 


(4/3 + 10/5 + 6(3)e~ 2x = -\ e ~ 2x , 


which gives (3 = 

The overall particular integral is thus 




15.7.2 EXERCISES 

Solve completely the following differential equations subject to the given boundary condi- 
tions: 


1 . 


dx z dx 


where y = 0 and ^ — | when x = 0. 


2. 


— + 9y = 2 sin 3a;, 
da; 2 


where y = 2 and ^ — | when x — 0. 


5 



3 . 


- 10^ + 25y = 8e 3x + 25a; 2 - 20a; + 27, 
dar da; 


where y = 5 and ^ = 13 when x — 0. 


dx 


d 2 y dy 

— - 3-f- + 2 y = cosh a;, 
dar da; 


where y = A and ^ = * when x = 0. 


12 ^ ^ dx 2 


i pL + 4 d / + y = 24e-i* 

dx A dx 


where y = 6 and xz = 2 when x = 0. 


dx 


15.7.3 ANSWERS TO EXERCISES 

1 . 


2 . 


y = -xe x + Ae x + Be 3a: . 


y = — -x cos 3a; + 2 cos 3a; + sin 3a;. 
o 


3. 


4. 


5. 


y = 2e 3x + a; 2 + 1 + (2 - 3x)e 5x . 
V=Y^ {e~ x ~ dxe x -e x + 7e 2x ) . 
y = 3x 2 e~^ x + (5a; + 6)e - ^. 
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UNIT 15.8 - ORDINARY DIFFERENTIAL EQUATIONS 8 
SIMULTANEOUS EQUATIONS (A) 

15.8.1 THE SUBSTITUTION METHOD 

The methods discussed in previous Units for the solution of second order ordinary linear 
differential equations with constant coefficients may now be used for cases of two first or- 
der differential equations which must be satisfied simultaneously. The technique will be 
illustrated by the following examples: 

EXAMPLES 

1. Determine the general solutions for y and z in the case when 


Av 


-2— +±y-z 
ax 


d y 

da; 


+ 8y — 3z 



Solution 

First, we eliminate one of the dependent variables from the two equations; in this case, 
we eliminate z. 

From equation (2), 




+ 8y — 5e 



and, on substituting this into equation (1), we obtain 


Ay 


|| +8 ^ + 5 e -d +4 l(A +8!/ 

dar da; / 3 V da; 


be”* = 


= e 


That is, 


2 d 2 y 2 dy 4 8 T 

- 4 — v = -e 

3 da; 2 3 da; 6 3 


or 


d 2 y 

da; 2 


dy 

da; 


-2 y 


-ie~ x . 


1 



The auxiliary equation is 


m 2 + m — 2 = 0 or (m — l)(m + 2) = 0, 

giving a complementary function of Ae x +Be~ 2x , where A and B are arbitrary constants. 
A particular integral will be of the form ke~ x , where k — k — 2k = —4 and hence k — 2. 
Thus, 


y = 2e~ x + Ae* + 5e“ 23: . 

Finally, from the formula for z in terms of y, 

z = ^ (~2e~ x + Ae* - 2 Be~ 2x + 16e~ x + 8Ae* + 8 Be~ 2x - 5e~ x ) . 

That is, 


z = 3e~ x + 3Ae~ x + 2 Be~ 2x . 


Note: 

The above example would have been a little more difficult if the second differential 
equation had contained a term in But, if this were the case, we could eliminate 
4^ between the two equations in order to obtain a statement with the same form as 
Equation (2). 

2. Solve, simultaneously, the differential equations 


( 1 ) 
( 2 ) 

given that y — 1 and z — 2 when x — 0. 

Solution: 

From equation (2), we have 


d^ 

v F 2y 

da; 


= e 


d y 

da; 


— 2z = 1 + x, 
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Substituting into the first differential equation gives 


1 f d 2 y 

2 dx 2 


+ 2 y = e x 


or 

d 2 v 

-JL + Ay = 2e x + 1. 
dx z 

The auxiliary equation is therefore m 2 + 4 = 0, having solutions m = ±j 2, which means 
that the complementary function is 


A cos 2 x + B sin 2.r, 

where A and B are arbitrary constants. 

The particular integral will be of the form y = pe x + q, 
where 


pe x + Ape x + 4 q = 2e x + 1. 

We require, then, that 5p = 2 and Aq — 1; and so the general solution for y is 

2 1 

y — A cos 2x + B sin 2x + -e x + 

5 4 


Using the earlier formula for z, we obtain 


1 

Z ~ 2 


-2 A sin 2x + 2 B cos 2x H — e x — 1 — x 

5 


= B cos 2 x — A sin 2x A — e x 


Applying the boundary conditions, 


2 1 


1 = A H 1 — giving A = — 

5 4 s 20 


and 


1 1 


23 


2 = B + - - - giving B = — . 


5 2 


10 


The required solutions are therefore 


and 


7 23 2 T 1 

V = 20 C ° S 2X + 10 SU1 2X + 5 ' 6 ' + 4 


23 7 1 r 1 x 

z = — cos 2x sm 2x 4 — e . 

10 20 5 2 2 


1 

2 


x 

2 ' 


3 



15.8.2 EXERCISES 


Solve the following pairs of simultaneous differential equations, subject to the given boundary 
conditions: 


1. 


^ + 2, = e- 

da; 

d^ 

— + 3^ = y, 
ax 


given that y — 1 and z — 0 when x — 0. 


d y 

da; 

d^ 

da; 


— z = sin X , 


+ y = cos a;, 


given that y — 3 and z — 4 when x = 0. 


d V a o q 
- + 2y-3z 

ax 


= 1, 


d£ 

da; 


y = e 


—2x 


given that y — 0 and z — 0 when x = 0. 


4. 


d y 

da; 

d^ 

da; 


2z-i 
8 Vi 


given that y — 1 and z — 0 when x = 0. 
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5. 


dy , dz 

— — b 4— — b 6z = 0, 
da; da; 

S^+2^+6, = 0, 

da; da; 


given that y = 3 and z = 0 when x = 0. 

Hint: First eliminate the 4 1 terms to obtain a formula for z in terms of y and 4^. 

ax. & ax 


6 . 

10 ^- 3 F + % + 5, = 0, 

da; da; 
dy dz 

2-p-y- + 2y + z = 2e , 
da; da; 

given that y = 2 and z = —1 when x — 0. 

Hint: First, eliminate the da an d z terms in one step, to obtain a formula for y in 
terms of ^ and x. 

ax 

15.8.3 ANSWERS TO EXERCISES 


1 . 


y = (2a; + l)e x and z = xe x . 
y = (x + 4) sin x + 3 cos x and z = (x + 4) cos x — 3 sin x. 


4. 


5. 


6 . 


y=-e x + -e -3 * - e“ 2x and z = -e x - 1 e~ 3x - -. 
y 2 2 2 6 3 


y = ^e 4x — 4x = sinh4a; and z = e 4x + e 4x = 2 cosh Ax. 


y = 2e x + e 2a: and z = e x — e 2x . 
y = sin a; + 2e _a; and z = e -a; — 2 cos a;. 
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UNIT 15.9 - ORDINARY DIFFERENTIAL EQUATIONS 9 
SIMULTANEOUS EQUATIONS (B) 

15.9.1 INTRODUCTION 

For students who have studied the principles of eigenvalues and eigenvectors (see Unit 9.6), a 
second method of solving two simultaneous linear differential equations is to interpret them 
as a single equation using matrix notation. The discussion will be limited to the simpler 
kinds of example, and we shall find it convenient to use t, X\ and X2 rather than x , y and z. 

15.9.2 MATRIX METHODS FOR HOMOGENEOUS SYSTEMS 

To introduce the technique, we begin by considering two simultaneous differential equations 
of the form 


“77“ = ax i + bx 2 , 
at 

= CX 1 + dx 2- 
at 


which are of the “homogeneous” type, since no functions of t, other than x\ and X2, appear 
on the right hand sides. 

(i) First, we write the differential equations in matrix form as 


'Xi~ 


'a 

b~ 


'Xi 

A'2_ 


c 

d_ 


X2_ 


which may be interpreted as 


— = MX where X 
dt 


X\ 

x 2 \ ’ 


and M 


a b ' 
c d ' 


(ii) Secondly, in a similar way to the method appropriate to a single differential equation, 
we make a trial solution of the form 


X = Ke At , 


1 



where 


is a constant matrix of order 2x1. 

This requires that 

XKe xt = MKe At or AK = MK, 

which we may recognise as the condition that A is an eigenvalue of the matrix M, and K is 
an eigenvector of M. 

The solutions for A are obtained from the “characteristic equation” 


In other words, 


|M — AI| = 0. 


a — A b 
c b - A 


0, 


leading to a quadratic equation having real and distinct solutions (A = Ai and A = A 2 ), real 
and coincident solutions (A only) or conjugate complex solutions (A = l ± jrn) . 

(iii) The possibilities for the matrix K are obtained by solving the homogeneous linear equa- 
tions 


(a. — \\k\ T &/c 2 — 0, 
cli' i -f- (d — Ai)/c 2 — 0, 


giving k\ : & 2 = 1 : a (say) 
and 


2 



0, 

0, 


(a - X 2 )ki + bk 2 = 
ck j + (d — X 2 )k 2 = 


giving ki : k 2 = 1 : (3 (say). 

Finally, it may be shown that, according to the types of solution to the auxiliary equation, 
the solution of the differential equation will take one of the following three forms, in which 
A and B are arbitrary constants: 

(a) 


A 


■ r 

a 


3 Ai t 


+ B 


1 ' 

J. 


3 A2 1 . 


(b) 


( At + B ) 


A 

+ 6 


„At. 


or 

c 


A 

pA + qB 


cos mt + 


B 

pB — qA J 


sin mt 


where, in (c), 1 : a = 1 : p + jq and 1 : /3 — 1 : p — jq. 

EXAMPLES 


1. Determine the general solution of the simultaneous differential equations 


(\x i 

dt 

dx 2 

dt 


-4aq + 5x 2 , 
—xi + 2x 2 . 


3 



Solution 

The characteristic equation is 


That is, 


-4- A 5 
-1 2 - A 


0 . 


A 2 + 2A — 3 = 0 or (A - 1)(A + 3) = 0. 
When A = 1, we need to solve the homogeneous equations 


—bk\ + bk 2 = 0, 
—k\ + k 2 = 0, 


both of which give k\ : k 2 — 1 : 1. 

When A = — 3, we need to solve the homogeneous equations 


— k\ + hk 2 — 0, 
k\ + 5 k 2 = 0, 


both of which give ki : k- 2 = 1 : 
The general solution is therefore 


Xi 

= A 

T 

e* + B 

'V 

1 

X 2 . 


.1. 


- 5 - 


or, alternatively, 


X\ 

_x 2 _ 


A 


+ B 


3t 


where A and B are arbitrary constants. 
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2. Determine the general solution of the simultaneous differential equations 


dx i 
dt 

dx 2 

dt 


Xi - x 2 , 
xi + 3x 2 . 


Solution 

The characteristic equation is 


1 - A -1 
1 3- A 


= 0. 


That is, 

A 2 — 4A + 4 = 0 or (A - 2) 2 = 0. 
When A = 2, we need to solve the homogeneous equations 


-ki - k 2 = 0, 

k\ + k 2 = 0, 


both of which give k\ : k 2 — 1 : — 1. 
The general solution is therefore 


Xi 

x 2 _ 





2 1 


5 


where A and B are arbitrary constants. 

3. Determine the general solution of the simultaneous differential equations 


dx i 

dt 

dx 2 

dt 


x\ - 5x 2 , 


2xi + 3x 2 . 
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Solution 

The characteristic equation is 


1 - A -5 
2 3- A 


0. 


That is, 


A 2 -4A + 13 = 0, 

which gives A = 2 ± j3. 

When A = 2 + j 3, we need to solve the homogeneous equations 


(-l-j3)k 1 -5 k 2 = 0, 
2Aq + (1 — j3)/c 2 = 0, 


both of which give k\ : k 2 — 1 : 1 5 • / ’ 3 . 

When A = 2 — j 3, we need to solve the homogeneous equations 


(—l+j3)ki — 5k 2 = 0, 

2k\ + (1 + j3)k 2 = 0, 


both of which give k\ : k 2 — 1 : 
The general solution is therefore 


t 


A 

-A + 3B 


cos 3f + 


B 

-B-3A 


sin 3 1 


where A and B are arbitrary constants. 

Note: 

From any set of simultaneous differential equations of the form 


d:r i Ax 2 

a— b b— b cx i + ax 2 

at at 


/ d.x' 1 r Ax 2 , , 

a -r- + 0 — — + 0 X\ + C X2 


dt 


dt 


0, 

0, 
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it is possible to eliminate and ^ in turn, in order to obtain two equivalent equations 
of the form discussed in the above examples. 

15.9.3 EXERCISES 

1. Determine the general solution of the simultaneous differential equations 

= x 1 + 3x 2 , 

= 3x\ + x 2 . 

2. Solve completely, the simultaneous differential equations 

3a;i + 2 x 2 , 

4xi + x 2 , 

given that x\ = 3 and x 2 = —3 when t — 0. 

3. Solve completely, the simultaneous differential equations 

X\ + 9 x 2 , 

llxi + x 2 , 

given that X\ = 20 and x 2 = 20 when t — 0. 

4. Solve completely, the simultaneous differential equations 

= 0 , 

= 0 , 

given that X\ — 2 and x 2 = 0 when t — 0. 


UXl 

dt 

dxi 


(XX 2 

dt 


+ 2.r | — 2x 2 


x- + 2 xi- 7x ‘- 5 ^ 


dx i 

dt 

dx 2 

dt 


dx i 

dt 

dx 2 

dt 


dx i 

dt 

dx 2 

dt 


7 



5. Determine the general solution of the simultaneous differential equations 


dx i 

d t 
dx 2 

dt 


5xi + 2x 2 , 
—2xi + x 2 . 


6. Determine the general solution of the simultaneous differential equations 


dx i 

dt 

dx 2 

dt 


8xi + x 2 , 

— 5aq + 6 x 2 . 


15.9.4 ANSWERS TO EXERCISES 
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UNIT 15.10 - ORDINARY DIFFERENTIAL EQUATIONS 10 
SIMULTANEOUS EQUATIONS (C) 

15.10.1 MATRIX METHODS FOR NON-HOMOGENEOUS SYSTEMS 


In Units 15.5, 15.6 and 15.7 , it was seen that, for a single linear differential equation with 
constant coefficients, the general solution is made up of a particular integral and a comple- 
mentary function (the latter being the general solution of the corresponding homogeneous 
differential equation). 

In the work which follows, a similar principle is applied to a pair of simultaneous non- 
homogeneous differential equations of the form 


dx i 

d t 
dx 2 

dt 


ax i + bx 2 + f{t), 
cx i + dx 2 + g(t). 


The method will be illustrated by the following example, in which f(t) = 0: 

EXAMPLE 

Determine the general solution of the simultaneous differential equations 


dxi 

dt 

dx 2 

dt 


x 2 > ( 1 ) 

-4x! — 5x 2 + g(t), (2) 


where g{t ) is (a) t, (b) e 2t (c) sint, (d) e~ l . 

Solutions 

(i) First, we write the differential equations in matrix form as 


d rxi' 
dt [ x 2 _ 


' 0 1 ' 

-4 -5 



'O' 

1 


9{t)> 


1 



which may be interpreted as 


dX 

dt 


MX + N g(t) where X 





and N 


'0 

1 


(ii) Secondly, we consider the corresponding “homogeneous” system 


dX 

dt 


MX, 


for which the characteristic equation is 


0 - A 1 
-4 -5 - A 


0, 


and gives 


A(5 + A) + 4 = 0 or A 2 + 5A + 4 = 0 or (A + 1)(A + 4) = 0. 


(iii) The eigenvectors of M are obtained from the homogeneous equations 


— A k\ + /c 2 — 0, 
— 4£q — (5 + A)/c 2 = 0. 


Hence, in the case when A = —1, we solve 


k\ T &2 — 0, 

— 4/ci — 4 /c 2 = 0, 


and these are satisfied by any two numbers in the ratio /q : & 2 = 1 : — 1. 
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Also, when A = —4, we solve 


Ak\ + — 0, 

—4 ki — k 2 = 0 

which are satisfied by any two numbers in the ratio k\ : k 2 = 1 : —4. 

The complementary function may now be written in the form 

where A and B are arbitrary constants. 

(iv) In order to obtain a particular integral for the equation 

dX 

— = M X + Ng(t), 

we note the second term on the right hand side and investigate a trial solution of a similar 
form. The three cases in this example are as follows: 

(a) g(t) = t 

Trial solution X = P + Q t, 

where P and Q are constant matrices of order 2x1. 

We require that 

Q = M(P + Q t) + N t, 

whereupon, equating the matrix coefficients of t and the constant matrices, 

MQ + N = 0 and Q = MP, 


-t 


+ B 


1 

-4 


-it 
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giving 


Thus, using 


Q = — M -1 N and P = M -1 Q. 


M _1 


i r —5 -r 
4 [ 4 0 J ’ 


we obtain 


and 


1 

r-s -ii 


'O' 


'0.25' 

4 

o 

i 


.1. 


0 


'-5 

-1' 


'0.25' 


'-0.3125' 

. 4 

0 . 


0 


0.25 


The general solution, in this case, is 


X = A 


' 1 ' 

f 7 — v 

' 1 ' 

—At , 

'-0.3125' 


.-1. 

e * + B 

.-4. 

e 4t + 

0.25 

+ 


0.25 

0 


(b) git) = e 2t 


Trial solution X = Pe 2t 


We require that 


2Pe 2t = MPe 2t + Ne 2t . 


That is, 


2P = MP + N. 
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The matrix, P, may now be determined from the formula 


(21 — M)P = N; 


or, in more detail, 


'2 

4 



.P = N. 


Hence, 


1 

■ 7 r 


'1' 

1 

' 7 ' 

18 

.-4 2_ 


. 0 . 

_ 18 

.-4. 


The general solution, in this case, is 


X = A 

' 1 ' 

e“* + B 

' 1 ' 

e“ 4t + - 

' 7 ' 


.- 1 . 


.-4. 

18 

.-4. 


2 1 


(c) g(t) = sin t 


Trial solution X = P sin t + Q cos t. 


We require that 


P cos t — Q sin t = M(P sin t + Q cos t) + N sin t. 


Equating the matrix coefficients of cos t and sin t, 


P = MQ and - Q = MP + N, 


which means that 


-Q = M 2 Q + N or (M 2 + I)Q = -N. 
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Thus, 


Q = — (M 2 + I) _1 N, 


where 


M 2 + I 


' 0 

1 ' 


' 0 

1 ' 

+ 

'1 

O' 

.-4 

—5 


.-4 

—5 

.0 

1 . 


'-3 -5' 
20 22 


and, hence, 


Also, 


1 

'22 5 ' 


'O' 

1 

'-5' 

34 

.-20 -3. 


.1. 

“ 34 

. 3 . 


P = MQ 


1 

r o li 


'-5' 

1 

'3' 

34 

LQ 

1 

1 

1 


. 3 . 

“ 34 

.5, 


The general solution, in this case, is 


X = A 

' 1 ' 

e -t + B 

' 1 ' 

e~ 4t + - 

'3' 

1 

sin t +;ifefE 

'-5' 




.-4. 

34 

.5. 

34 

3 


(d) g{t) = e * 

In this case, the function, g(t), is already included in the complementary function and it 
becomes necessary to assume a particular integral of the form 


X = (P + Qf)e _t , 


where P and Q are constant matrices of order 2x1. 
We require that 


Qe“* - (P + Qt)e _t = M(P + Qt)e _t + Ne _t , 
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whereupon, equating the matrix coefficients of te * and e f , we obtain 


-Q = MQ and Q - P = MP + N. 

The first of these conditions shows that Q is an eigenvector of the matrix M corresponding 
the eigenvalue —1 and so, from earlier work, 


Q — k 


for any constant k. 
Also, 


(M + I)P = Q-N; 


or, in more detail, 

O' 
1 . 

Hence, 


Pi + P-2 = k, 

—4 pi — 4 p-2 = —k — 1. 



Using pi + p 2 = k and pi + p 2 = we deduce that k — and that the matrix P is given 
by 


P 


r i i 

i 

'O' 

+ / 

' 1 ' 


“ 3 

.1. 

,-l. 


for any number, l. 
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Taking / = 0 for simplicity, a particular integral is therefore 


3 


+ 


1 

-1 


t> e 


-t 


and the general solution is 


X = A 


1 

-1 


e~ f + B 


1 

-4 


e~ 4t + 


+ 


1 

-1 


t> e 


-t 


Note: 

In examples for which neither f(t) nor g(t) is identically equal to zero, the particular integral 
may be found by adding together the separate forms of particular integral for f(t) and g(t) 
and writing the system of differential equations in the form 

dX 

— = MX + N 1 /(t) + N 2 s(t), 
dt 


where 


Ni = 


T' 

0 


and N 9 = 


'O' 

1 


For instance, if f(t) = t and g(t) = e 2t , the particular integral would take the form 


X = P + Qt + Re 2t , 


where P, Q and R are matrices of order 2x1. 


15.10.2 EXERCISES 

1. Determine the general solutions of the following systems of simultaneous differential 
equations: 


(a) 


dx i 
dt 

&x 2 

dt 


x\ + 3x2 + 5 1 , 
+ x 2 + e 3t . 



(b) 


clx i 

d t 
dx 2 

d t 


3xi + 2x 2 + t 2 , 
4xi + x 2 + e~ 2t . 


2. Determine the complete solutions of the following systems of differential equations, 
subject to the conditions given: 

(a) 


dx i 

d t 
dx 2 

d t 


—xi + 9 x 2 + 3 , 
llxi + x 2 + e 10t , 


given that X\ 


(b) 


- and x 2 = — ybr when t — 0. 


dx i 

d t 
dx 2 

dt 


t)X\ + 2 x 2 H - 2 t 2 H - t, 
-2xi + x 2 , 


given that X\ 


(c) 


p and x 2 = — P when t — 0. 

= 8x i + x 2 + sin t, 

= — 5xi + 6x 2 + cost, 


dx i 
dt 
dx 2 

dt 


given that X\ — 0 and x 2 = 0 when t = 0. 
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15.10.3 ANSWERS TO EXERCISES 


1. (a) 


(b) 



1 . 1 

'-25' 

1 

' 5 ' 

1 

'3' 

1 ^ 
1 CO 

+ 

a 

1 

. 15 . 

+ 8 

.-15. 

t 

5 

.2. 


e 


3 1 . 


A 


1 ' 

1 


e 5t + B 


i 2 

' 41 ' 

2 

-9' 

1 

' 1 ' 

2 1 

' 2 ' 

7 + I25 

.-84. 

+ 

to 1 
Or 1 

. 16 . 

t+ 5 

.-4. 

* + 7 

—5 


2. (a) 


(b) 


(c) 


7 

45 


' 1 ' 

e~ 10t + — 

' 9 ' 

e lw + — 

' 1 ' 

1 

H 

'O' 

wt 1 

' 9 ' 

.-1. 

900 

.11. 

100 

.-11. 

180 

.1. 

20 

.11. 


te 10t ] 



e 3t + 


1 

27 


' 5 ' 
-12 



1 

-22 



1' 

2 


t 


2 . 


1 

145 



■-r 

25 


cos 2 1 + 


' — 12 ' 
-10 


sin 2 1 ) 


+ 


' —17' 
-10 


sin t + 


1 

-25 
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UNIT 16.1 - LAPLACE TRANSFORMS 1 - DEFINITIONS AND RULES 


16.1.1 INTRODUCTION 

The theory of “Laplace Transforms” to be discussed in the following notes will be for 
the purpose of solving certain kinds of “differential equation” ; that is, an equation which 
involves a derivative or derivatives. 


The particular differential equation problems to be encountered will be limited to the two 
types listed below: 

(a) Given the “first order linear differential equation with constant coefficients” , 

dx r . . 

°d i + bX = 

together with the value of x when t — 0 (that is, x(0)), determine a formula for x in terms 
of t, which does not include any derivatives. 


(b) Given the “second order linear differential equation with constant coefficients” , 


d?x dx 

“d ¥ +b dt +cx 


f(t), 


together with the values of x and when t — 0 (that is, x(0) and x 7 (0)), determine a formula 
for x in terms of t which does not include any derivatives. 


Roughly speaking, the method of Laplace Transforms is used to convert a calculus problem 
(the differential equation) into an algebra problem (frequently an exercise on partial fractions 
and/or completing the square). 

The solution of the algebra problem is then fed backwards through what is called an “Inverse 
Laplace Transform” and the solution of the differential equation is obtained. 



CALCULUS 



LAPLACE 



The background to the development of Laplace Transforms would be best explained using 
certain other techniques of solving differential equations which may not have been part of 
earlier work. This background will therefore be omitted here. 
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DEFINITION 

The Laplace Transform of a given function /(f), defined for f > 0, is defined by the definite 
integral 

roc 

/ e~ st f(t) df, 

Jo 

where s is an arbitrary positive number. 

Notes 

(i) The Laplace Transform is usually denoted by L[f(t)\ or F(s), since the result of the 
definite integral in the definition will be an expression involving s. 

(ii) Although s is an arbitrary positive number, it is occasionally necessary to assume that it 
is large enough to avoid difficulties in the calculations; (see the note to the second standard 
result below). 

16.1.2 LAPLACE TRANSFORMS OF SIMPLE FUNCTIONS 

The following is a list of standard results on which other Laplace Transforms will be based: 

i. fit) = r. 



I n say. 


Hence, 


In = 


t n e~ st 


n r c 

J — / 

s Jo 


e~ st t n ~ l 


n 


df 'In— 1, 

S 


using the fact that e st tends to zero much faster than any other function of f can tend 
to infinity. That is, a decaying exponential will always have the dominating effect. 

We conclude that 

n (n — 1) (n — 2) 2 1 


But, 


Thus, 


s s s 

CO | 
CO | 

roo 

p st 

I 0 = / df = 

C- 

Jo 

—s 


T - m T 
.In — .In. 


1 

,S 


m = 


n\ 

o71+l 


Note: 

This result also shows that 
since 1 = f°. 
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2. f(t) = e~ at . 


f-OO POO 

F(s ) = / e~ st e~ at d t = / e ~( s+a)t d t 
Jo Jo 


Hence, 


3 — (s+o)t 

■(s + a) 


L[e" at ] = 


s + a 


s + a 


Note: 

A slightly different form of this result, less commonly used in applications to science 
and engineering, is 

£[e“] = H-jS 

s — 0 

but, to obtain this result by integration, we would need to assume that s > b to ensure 
that e~( s ~ b>i is genuinely a decaying exponential. 

3. f{t) = cos at. 


f(s) = r 

2o 


e st cos at d t = 


e st sin at 


a 


H — / e sin at df 


a Jo 


using Integration by Parts, once. 

Using Integration by Parts a second time, 


F(s) = 0 + - 
a 


e st cos at 


a 


s f°° ] 

/ e~ st cos at d t > , 

a Jo J 


which gives 


F(s) = ^ - S -.F(s). 


a 2 a 2 ' 


That is, 

In other words, 




s 2 + a 2 


L [cos at] = 


s 2 + a 2 
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4. f{t) = sin at. 

The method is similar to that for cos at, and we obtain 

T r . , a 

L sm at = — — — 
s 2 + a 2 

16.1.3 ELEMENTARY LAPLACE TRANSFORM RULES 

The following list of results is of use in finding the Laplace Transform of a function which is 
made up of basic functions, such as those encountered in the previous section. 


1. LINEARITY 

If A and B are constants, then 

L[Af(t) + Bg(t)} = AL[f(t)] + BL\g(t)}. 

Proof: 

This follows easily from the linearity of an integral. 

EXAMPLE 

Determine the Laplace Transform of the function, 

2 1 5 + 7 cos4t — 1. 


Solution 


L[2t 5 + 7 cos4t — 1] 



+ 7. 


s 

s 2 + 4 2 


1 

s 


240 7s _ 1 
s 6 + s 2 + 16 s 


2. THE TRANSFORM OF A DERIVATIVE 


The two results which follow are of special use when solving first and second order 
differential equations. We shall begin by discussing them in relation to an arbitrary 
function, /(£); then we shall restate them in the form which will be needed for solving 
differential equations. 


(a) 


L[f(t)]=sL[f(t)]~f( 0 ). 


Proof: 


L[f(t)} = e~ st f(t ) df = e~ st f(t) 


JO 


+ s 




f(t) d t 


using integration by parts. 

Thus, 

L[f(t)] = -m+8L[f(t)], 


as required. 
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(b) 


nr(t)] = s 2 mt)]-sm-f( o). 


Proof: 

Treating f"(t) as the first derivative of f(t), we have 

L\f"(t)] = 8L[f'(t)]-f'( 0), 


which gives the required result on substituting from (a) the expression for L[f'(t)\. 
Alternative Forms (Using L[x(t)} = X(s))i 

(i) 


L 


da; 

d t 


sX(s) — x(0). 


11 


d 2 a; 
d t 2 


= s 2 X(s) — sx( 0) — x'(0) or s[sX(s) — x(0)] — a/(0). 


3. THE (First) SHIFTING THEOREM 


L[e- at f(t)]=F(s + a). 


Proof: 


roc roc 

L[e~ at f(t )] = / e~ st e~ at f{t) d t = / e~^ +a)t f{t) d t, 

Jo Jo 

which can be regarded as the effect of replacing s by s + a in L[f(t)]. In other words, 
F(s + a). 

Notes: 

(i) Sometimes, this result is stated in the form 

L[e u f(t)] = F(s - b) 


but, in science and engineering, the exponential is more likely to be a decaying expo- 
nential. 

(ii) There is, in fact, a Second Shifting Theorem, encountered in more advanced courses; 
but we do not include it in this Unit (see Unit 16.5). 

EXAMPLE 

Determine the Laplace Transform of the function, e~ 2t sin 3 1. 

Solution 


First of all, we note that 


L[sin 3 1] 


3 

s 2 + 3 2 


3 

s 2 + 9 


Replacing s by (s + 2) in this result, the First Shifting Theorem gives 


L[e 2t sin3f] 


3 

(s + 2) 2 + 9 ' 
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4. MULTIPLICATION BY t 


mm) = - 


Proof: 

It may be shown that 

s [fwi= .c Ts ie ~“ mdt= r *= 

EXAMPLE 

Determine the Laplace Transform of the function, 

t cos 7 1. 


Solution 


L[t cos 7 1] 


dr s' 
ds _s 2 + 7 2 . 


(s 2 + 7 2 ).l — s.2s 
(s 2 + 7 2 ) 2 


- L[tf(t)\. 


s 2 -49 
s 2 + 49) 2 ' 
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THE USE OF A TABLE OF LAPLACE TRANSFORMS AND RULES 


For the purposes of these Units, the following brief table may be used to determine the 
Laplace Transforms of functions of t without having to use integration: 


m 

L[f{t )] = F(s) 

K (a constant) 

K 

s 


i 


s+a 

t n 

n! 

s n + i 

sin at 

a 

s 2 +a 2 

s inhat 

a 

s 2 —a 2 

cos at 

s 

s 2 +a 2 

cosh at 

s 

s 2 —a 2 

te~ at 

i 

(s+a) 2 

t sin at 

2 as 

(s 2 +a 2 ) 2 

t cos at 

(s 2 -a 2 ) 

(s 2 +a 2 ) 2 

sin at — at cos at 

2a 3 

(s 2 +a 2 ) 2 


16.1.4 FURTHER LAPLACE TRANSFORM RULES 
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4 . The Final Value Theorem 


lim f{t) = lim sF(s), 

t — >oo s — >0 

provided that the indicated limits exist. 

5 . The Convolution Theorem 

L\f f(T)g(t — T) dr] =F(a)G(a). 

16.1.5 EXERCISES 




e~ 2t (2 1 3 + 3t - 2) 


(d) 


cosh2f. sin t: 


e- at f(t), 


where L[f(t)} = F(s). 
3. (a) If 


x = t 3 e *, 


determine the Laplace Transform of without differentiating x more than once 
with respect to t. 

(b) If 


da; , 

*- +I = e ’ 


where x(0) = 0, show that 


A'( S ) = 


1 

s 2 — 1 


4. Verify the Initial and Final Value Theorems for the function 


fit) = te~ 3t . 


16.1.6 ANSWERS TO EXERCISES 

1. (a) 

6 4 1_ 

s 3 + s 2 s’ 

(b) 

6 6 3 1 

S 4 + S 3 + S 2 + S ’ 

(c) 

2 3 , i 

s-5 ~~ s — 1 + s + 7’ 

(d) 

6 3s 

s 2 + 9 s 2 + 4’ 
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(e) 


(f) 


(g) 


2. (a) 


(b) 


(c) 


(d) 


(e) 


12s 

(s 2 + 36) 2 ’ 


1 : 1 
(s- l) 2 + (s + 2) 2 ’ 


1 ri s 

2 [s ~ s 2 + 4. ' 


s + 3 

(s + 3) 2 + 25’ 


2 


12 | 3 2 

(s + 2) 4 + (s + 2) 2 “ s~+~2 ’ 


1 1 
2 [(s- 2) 2 + l 


1 

(s + 2)2 + 1 


(s + a)F(s + a) — /( 0). 


3. (a) 


6s 2 

(ST!) 1 ’ 


(b) 


On the left hand side, use the formula for L 


dx 

d£ 


4. 


lim f{t) =0 and lim /(f) = 0 

t — >0 t — xx) 
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UNIT 16.2 - LAPLACE TRANSFORMS 2 
INVERSE LAPLACE TRANSFORMS 

In order to solve differential equations, we now examine how to determine a function of the 
variable, t, whose Laplace Transform is already known. 

16.2.1 THE DEFINITION OF AN INVERSE LAPLACE TRANSFORMS 

A function of t, whose Laplace Transform is the given expression, F(s), is called the “Inverse 
Laplace Transform” of f(t) and may be denoted by the symbol 

i-'lfW]. 

Notes: 

(i) Since two functions which coincide for t > 0 will have the same Laplace Transform, we 
can determine the Inverse Laplace Transform of F(s) only for positive values of t. 

(ii) Inverse Laplace Transforms are linear since 

L~ l [AF(s) + BG(s )] 

is a function of t whose Laplace Transform is 

AF(s) + BG(s)] 

and, by the linearity of Laplace Transforms, discussed in Unit 16.1, such a function is 

AL~ 1 [F(s)] + BL~ l [G(s)]. 

16.2.2 METHODS OF DETERMINING AN INVERSE LAPLACE 
TRANSFORM 

The type of differential equation to be encountered in simple practical problems usually 
lead to Laplace Transforms which are “rational functions of s” . We shall restrict the 
discussion to such cases, as illustrated in the following examples, where the table of standard 
Laplace Transforms is used whenever possible. The partial fractions are discussed in detail, 
but other, shorter, methods may be used if known (for example, the “Cover-up Rule” and 
“Keily’s Method”; see Unit 1.9) 

EXAMPLES 

1. Determine the Inverse Laplace Transform of 



Solution 

f(t) = + 4e 2t t> 0 


1 



2. Determine the Inverse Laplace Transform of 


2s + 3 _ 2s + 3 
s 2 + 3s s(s + 3) 


Solution 

Applying the principles of partial fractions, 

2s + 3 _ A B 
s(s + 3) s s + 3’ 


giving 


2s + 3 = A(s + 3) + Bs 


Note: 

Although the s of a Laplace Transform is an arbitrary positive number, we may tem- 
porarily ignore that in order to complete the partial fractions. Otherwise, entire partial 
fractions exercises would have to be carried out by equating coefficients of appropriate 
powers of s on both sides. 

Putting s = 0 and s = — 3 gives 



3 = 3 A and — 3 = —3 B 

so that 

A = 1 and B — 1. 

Hence, 

F(s) = 1 + -L- 
s s + 3 

Finally, 

fit) = 1 + e~ 3t t > 0. 

Determine the Inverse Laplace Transform of 


F « - ^9- 


Solution 


m 


i 

- sin 3 1 t > 0. 
o 


4. Determine the Inverse Laplace Transform of 


F(s) 


s + 2 
s 2 + 5 


Solution 


/(f) = cos ty/5 + 


—j= sin t \/ 5 
V5 


t > 0. 


2 



5. Determine the Inverse Laplace Transform of 


F(s) 


3s 2 + 2s + 4 
(s + l)(s 2 + 4) ' 


Solution 

Applying the principles of partial fractions, 

3s 2 + 2s + 4 _ A Bs + C 
(s + l)(s 2 + 4) s + l + s 2 + 4 


That is, 


3s 2 + 2s + 4 = A(s 2 + 4) + (Bs + C)(s + 1). 


Substituting s = —1, we obtain 


5 = 5 A which implies that A — 1. 


Equating coefficients of s 2 on both sides, 

3 = A + B so that B — 2. 


Equating constant terms on both sides, 

4 = 4 A + C so that C — 0. 

We conclude that 


1 2s 

F ( s ) = — WT + 


s + 1 s 2 + 4 


Hence, 


n*) = ,_ , 


f(t) — e 4 + 2 cos 2 t t > 0 . 

6. Determine the Inverse Laplace Transform of 

1 

(s + 2 y 

Solution 

Using the First Shifting Theorem and the Inverse Laplace Transform of 

f(t) = —t 4 e~ 2t t > 0. 
j \ j 24 

7. Determine the Inverse Laplace Transform of 


F(s) = 


(s — 7) 2 + 9' 


Solution 

Using the First Shifting Theorem and the Inverse Laplace Transform of s2 ° 2 

/(f) = e 7t sin3f t > 0. 


, we obtain 


, we obtain 
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8. Determine the Inverse Laplace Transform of 


F(s) 


s 

s 2 + 4s + 13 


Solution 

The denominator will not factorise conveniently, so we complete the square, giving 

F(S) = (s + 2) 2 + 9 ' 


In order to use the First Shifting Theorem, we must try to include s+2 in the numerator 
so we write 


, . (s + 2) — 2 s + 2 2 3 

^ “ (s + 2) 2 + 9 ““ (s + 2) 2 + 3 2 _ 3 (s + 2) 2 + 3 2 ’ 


Hence, 


f(t) = e 2t cos3 1 — 


V 2t sin3 1 

o 


2< [3 cos 3t — 2 sin 3t] t> 0. 
o 


9. Determine the Inverse Laplace Transform of 


F(s) 


8(s + l) 
s(s 2 + 4s + 8) ’ 


Solution 

Applying the principles of partial fractions, 

8( s + 1) _ A Bs + C 

s(s 2 + 4s + 8) s s 2 + 4s + 8 


Multiplying up, we obtain 

8(s + 1) = A(s 2 + 4s + 8) + (Bs + C)s. 

Substituting s = 0 gives 

8 = 8A so that A — 1. 
Equating coefficients of s 2 on both sides, 

0 = A + B which gives B = — 1. 

Equating coefficients of s on both sides, 

8 = 4 A + C which gives C = 4. 


Thus, 


F(s) 


1 -s + 4 
s s 2 + 4s + 8 ’ 
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The quadratic denominator will not factorise conveniently, so we complete the square 
to give 

1 — s + 4 

s + {s + 2) 2 + 4’ 

which, on rearrangement, becomes 


s (s + 2) 2 + 2 2 

Thus, from the First Shifting Theorem, 

/(f) = 1 — e~ 2t cos 2 f + 3e 


+ 


— 2t 


6 

(s + 2) 2 + 2 2 ’ 


sin2f f > 0. 


10. Determine the Inverse Laplace Transform of 


F(s) 


s + 10 

s 2 -4s - 12' 


Solution 

This time, the denominator will factorise, into (s + 2)(s — 6), and partial fractions give 

s + 10 A t B 

(s + 2)(s — 6) = s + 2 + s — 6 


Hence, 

s + 10 = A(s — 6) + B(s + 2) 

Putting s = —2, 

8 = — 8H giving A = — 1. 

Putting s = 6, 

16 = 8 B giving B — 2. 

We conclude that 

-1 2 

F(s) = 1 . 

V ; s + 2 s-6 

Finally, 

/(f) = —e~ 2t + 2e 6t t > 0. 


However, if we did not factorise the denominator, a different form of solution could be 
obtained as follows: 

( S - 2) + 12 _ *-2 4 

■’ (s - 2) 2 - 4 2 (s — 2) 2 — 4 2 (s-2) 2 +4 2 ' 

Hence, 

/(f) = e 2t [coshff + 3sinh4f] f > 0. 
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11. Determine the Inverse Laplace Transform of 


FtS> (s - l)(s + 2) 

Solution 

The Inverse Laplace Transform of this function could certainly be obtained by using 
partial fractions, but we note here how it could be obtained from the Convolution 
Theorem. 

Writing 

F{s) = (I^T)-(I+2)’ 

we obtain 

/(f) = T e T .e~ 2 ^ d T = f e( 3T - 2 *) dT = . 

J W Jo Jo [ 3 J o 

That is, 

P t p -2 1 

™ = I— (>a 

16.2.3 EXERCISES 

Determine the Inverse Laplace Transforms of the following rational functions of s: 

1 

(^i? ; 

l 

(s + l) 2 + 4’ 
s + 2 

(s + 2) 2 + 9’ 
s — 2 


!• (a) 
(b) 


(d) 


6 



(e) 


1 

(s 2 + 4) 2 ’ 


(f) 

(g) 

(h) 

(i) 
0 ) 

(k) 

( l ) 


s + 1 

s 2 + 2s + 5 ’ 
s-3 

s 2 -4s + 5’ 
s — 3 

(s-ms-2y 

5 

(s + l)(s 2 — 2s + 2) ’ 
2s -9 

(s — 3) (s + 2) 

3 

s(s 2 + 9) ’ 

2s - 1 

(s — l)(s 2 + 2s + 2) ’ 


2. Use the Convolution Theorem to obtain the Inverse Laplace Transform of 

s 

(s 2 + l) 2 ‘ 


16.2.4 ANSWERS TO EXERCISES 


1. (a) 


te* t > 0; 


7 



(b) 

1 , 


-e f sin 2 1 t > 0; 

(c) 

e~ 2 *cos3 1 t > 0; 

(d) 

e 3t \t + 1 t 2 ] t > 0; 


L 2 J 

(e) 

[sin 2t — 2£ cos 2t] t > 0; 

16 

(f) 

e^*cos2 1 t > 0; 

(g) 

e 2t [cost — sint] t > 0; 

(h) 

2te t + e* - e 2t t > 0; 


e~ l + e*[2 sint — cost] t > 0; 

(j) 

^[13e" 2t -3e 3i ] t > 0; 
0 

(k) 

1. 

-[1 — cos 3 1] t > 0; 
o 

(1) 

1 

-[e* — cost + 8e _i sin t] t> 0 

0 

2. 

1 


-tsint t > 0. 
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UNIT 16.3 - LAPLACE TRANSFORMS 3 - DIFFERENTIAL EQUATIONS 

16.3.1 EXAMPLES OF SOLVING DIFFERENTIAL EQUATIONS 

In the work which follows, the problems considered will usually take the form of a linear 
differential equation of the second order with constant coefficients. 

That is, 


d 2 a; da; 

“a ¥ + b it +cx 


/(*)• 


However, the method will apply equally well to the corresponding first order differential 
equation, 

da; . 

a — — b bx = t(t). 
dt J w 

The technique will be illustrated by examples. 

EXAMPLES 


1. Solve the differential equation 


d 2 a; 

dt 2 



+ 13a; 


0, 


given that x = 3 and % = 0 when t = 0. 

Solution 

Taking the Laplace Transform of the differential equation, 

s[sX(s) - 3] + 4[sX(s) - 3] + 13X (s) 


0 . 


Hence, 

giving 


(s 2 + 4s + 13)X(s) = 3s + 12, 


X(s) = 


3s + 12 


s 2 + 4s + 13 

The denominator does not factorise, therefore we complete the square to obtain 


X(s) 


3s + 12 
(s + 2) 2 + 9 


3(s + 2) + 6 _ s + 2 3 

(s + 2) 2 + 9 “ (s + 2) 2 + 9 + (s + 2) 2 + 9' 


1 



Thus, 


x(t) — 3e i/: cos3t + 2e 2i sin3t t> 0 


or 


x(t) = e 2t [3cos3t + 2sin3t] t > 0. 
2. Solve the differential equation 

d!| +6 ^ + te = 50 sm t , 

at 2 at 

given that x — 1 and % = 4 when t = 0. 

Solution 

Taking the Laplace Transform of the differential equation, 

s[sX(s) - 1] - 4 + 6[sX(s) - 1] + 9X(s) = 


50 


s 2 + 1’ 


giving 


50 

(s 2 + 6s + 9)A (s) = — » — — + s + 10. 

s 2 + 1 


Hint: Do not combine the terms on the right into a single fraction - it won’t help 
Thus, 


*(s) = 


50 


+ 


s + 10 


(s 2 + 6s + 9)(s 2 + 1) s 2 + 6s + 9 


or 


X(s) = 


50 


+ 


s + 10 


(s + 3) 2 (s 2 + l) (s + 3) 2 ' 

Using the principles of partial fractions in the first term on the right, 

50 


A B | Cs + D 

(s + 3) 2 (s 2 + l) = (s + 3) 2 + s + 3 + s 2 + l ' 


Hence, 


50 = A(s 2 + 1) + B{s + 3)(s 2 + 1) + (Cs + D)(s + 3) 2 . 
Substituting s = —3, 

50 = 10 A giving A — 5. 

Equating coefficients of s 3 on both sides, 

0 = B + C. (1) 
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Equating the coefficients of s on both sides (we shall not need the s 2 coefficients in this 
example) , 

0 = B + 9C + 6D. (2) 

Equating the constant terms on both sides, 

50 = A + 3B + 9D = 5 + 3B + 9D. (3) 


Putting C = —B into (2), we obtain 


- 8B + 6D = 0, (4) 

and we already have 

3B + 9D = 45. (3) 


These last two solve easily to give B — 3 and D = 4 so that C = —3. 
We conclude that 


50 _ 5 3 

(s + 3) 2 (s 2 + 1) = (s + 3) 2 + s + 3 


-3s + 4 
s 2 + 1 


In addition to this, we also have 

s + 10 _ s + 3 7 1 7 

(s + 3) 2 = (s + 3) 2 + (s + 3) 2 = s + 3 + (s + 3)2 ’ 


The total for X(s) is therefore given by 

12 


X(s) = 


4 s t 1 

— 3.^r— r + 4. 


(s + 3) 2 s + 3 s 2 + l s 2 + l 


Finally, 


x(t) = 12 te~ 3t + 4e _3f — 3 cos t + 4 sin t t > 0. 
3. Solve the differential equation 

d 2 a: , dx , 

de + dt “ ’ 

given that x — 1 and ^ = —2 when t — 0. 

Solution 

Taking the Laplace Transform of the differential equation, 


s[sX(s) - 1] + 2 + 4[sX(s) - 1] - 3X(s) = 


s — 1 


3 



This gives 


(s 2 + 4s — 3) A (s) — — + s + 2 


Therefore, 


*(s) = 


s — 1 


+ 


s + 2 


(s — l)(s 2 + 4s — 3) s 2 + 4s — 3’ 
Applying the principles of partial fractions, 

4 _ A Bs + C 

(s — l)(s 2 + 4s — 3) s — 1 + s 2 + 4s — 3 

Hence, 

4 = A(s 2 + 4s - 3) + (Bs + C)(s - 1). 
Substituting s = 1, we obtain 

4 = 2 A; that is, A = 2. 

Equating coefficients of s 2 on both sides, 

0 = A + B, so that B = —2. 
Equating constant terms on both sides, 

4 = —3 A — C , so that C = —10. 

Thus, in total, 


w N 2 -s - 8 

A ( s )= + 


-s — 8 


or 


X(s) = 


s — 1 s 2 + 4s — 3 s — 1 (s + 2) 2 — 7 
2 s + 2 6 


Finally, 


s — 1 (s + 2) 2 -7 (s + 2) 2 — 7 


6 


x(t) = 2e t — e 2t cosh tV7 i=e 2t sinh tV7 t > 0. 

V7 


16.3.2 THE GENERAL SOLUTION OF A DIFFERENTIAL EQUATION 


On some occasions, we may either be given no boundary conditions at all; or else the bound- 
ary conditions given do not tell us the values of x(0) and 2/(0). 

In such cases, we simply let x(0) = A and a/(0) — B to obtain a solution in terms of A and 

B called the ’’general solution”. 

If any non-standard boundary conditions are provided, we then substitute them into the 
general solution to obtain particular values of A and B. 


4 



EXAMPLE 


Determine the general solution of the differential equation 

d 2 x 


d t 2 


+ 4x = 0 


and, hence, determine the particular solution in the case when x(|) = —3 and x'( 

Solution 

Taking the Laplace Transform of the differential equation, 


That is, 
Hence, 


s(sX(s) - A) -B + 4X(s) = 


(s 2 + 4)X(s) = As + B. 


Vi \ — As + B _ s 
X (s) = = A + B. 

s z + 4 s z + 4 


0 . 


1 

s 2 + 4 


This gives 


x(t) = A cos 2t + — sin 2t t > 0; 

but, since A and B are arbitrary constants, this may be written in the simpler form 

x(t) = A cos 2 1 + B sin 2 1 t > 0, 


in which A has been rewritten as H. 

To apply the boundary conditions, we require also the formula for x'(t), namely 

x'(t) = — 2 A sin 2t + 2B cos 2t. 


Hence, — 3 = — A and 10 = — 2 B giving A = 3 and B = - 5. 
Therefore, the particular solution is 

x(t) = 3 cos 2t — 5 sin 2 1 t > 0. 
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16.3.3 EXERCISES 


1. Solve the following differential equations subject to the conditions given: 
(a) 


d 2 x „ da; 
d t 2 

given that a;(0) = 3 and x'(O) = 1; 


d^- 2 * +5 " = 0 ' 


(b) 


, d 2 a; , da; 

4 W + 4 m + ^°- 


given that a;(0) = 4 and a;'(0) = 1; 


(d) 


d 2 a; „ da; 
dt 2 

given that a;(0) = 3 and a;'(0) = 1; 

d 2 a; 


d« 2 +2 di~ 8x = 2t ‘ 


-4x = 2e 2t , 


dt 2 

given that a;(0) = 1 and a;'(0) = 10.5; 


d 2 a; 

dt 2 

given that a;(0) = 1 and a;'(0) = 2. 
Hint: cos 2 1 = 2cos 2 t — 1. 


+ 4a; = 3cos 2 t, 


2. Determine the particular solution of the differential equation 


d 2 a; da; 

dt 2 dt 

in the case when a;(0) = 2 and a;(3) = —1. 


e\t - 3) 


Hint: 

Since a;(0) is given, just let a/(0) = B to obtain a solution in terms of B; then substitute 
the second boundary condition at the end. 


6 



16.3.4 ANSWERS TO EXERCISES 


1. (a) 


giving 


(b) 


giving 


(c) 


giving 


(d) 


giving 


(e) 


giving 


2 . 


X(s) 


3s -5 
s 2 -2s + 5’ 


x(t) = e*(3cos2t — sin2t) t > 0; 


w N 3 4 3 

x(s) ~ 7+J + (J+F’ 


x(t) = 4 e _ 2 4 + 3te~ ^ = e~^[A + 3 1] t> 0; 


_ 27 1 39 1 1 1 1 1 

^ ~~ 12 s - 2 + 48’ s + 4 ~ 4V ~~ 16 V 


. , 27 2 . 39 _ 4 , 1 1 

X(t) = l2 e + 48 C " _ 4 t_ 16 t>0; 




x(t) = -te 2t + 3e 2< - 2e" 2t t > 0; 


w . 3 s 3 15s 

X (S) = o' 


2'(s 2 + 4) 2 8's 8’s 2 + 4 s 2 + 4’ 


3 3 5 

x(t) = -i sin 2t + - + - cos 2t + sin 2t t > 0. 


x(t) = 3e‘ — V — 1 t > 0. 
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UNIT 16.4 - LAPLACE TRANSFORMS 4 
SIMULTANEOUS DIFFERENTIAL EQUATIONS 

16.4.1 AN EXAMPLE OF SOLVING SIMULTANEOUS 
LINEAR DIFFERENTIAL EQUATIONS 

In this Unit, we shall consider a pair of differential equations involving an independent 
variable, t, such as a time variable, and two dependent variables, x and y , such as electric 
currents or linear displacements. 

The general format is as follows: 


dx dy 

a \-rr + Oi— + cix + diy 
dt d t 

dx dy 

a >2 -r- + 02 - 7 - + c 2 x + d 2 y 


h(t), 


To solve these equations simultaneously, we take the Laplace Transform of each equation 
obtaining two simultaneous algebraic equations from which we may determine X(s) and 
y(s), the Laplace Transforms of x(t) and y(t) respectively. 

EXAMPLE 

Solve, simultaneously, the differential equations 


dy 

dt 

da; 

dt 


+ 2x 


-2 y 


1 + 1 , 


given that x(0) = 1 and y( 0) = 2. 

Solution 

Taking the Laplace Transforms of the differential equations, 


sY(s) -2 + 2X(s) = - 1 - 

s — 1 


1 



sA^(s) — 1 — 21' (s) — — I — 


That is, 


2 X(s) + sY(s) = + 2, (1 

s — 1 

sX(s)— 21'(s) = — I — — + 1. (2) 


Using (1) x 2 + (2) x s, we obtain 


(4 + s 2 )X(s) = -+4 + 1H fs. 

s — 1 s 


Hence, 


X(s) = 


(s — l)(s 2 + 4) s 2 + 4 s(s 2 + 4) s 2 + 4’ 


Applying the methods of partial fractions, this gives 


_ 2 1 7 s 23 1 11 

^ ~~ 5’s - 1 + 20 A 2 + 4 + ~5~’s 2 + 4 + 4 s 


Thus, 


2 17 23 

x(t) = -e* + - + — cos 2t + — sin 2 t t > 0. 

c) T ZU 1U 


We could now start again by eliminating x from equations (1) and (2) in order to calculate 
y, and this is often necessary; but, since 


„ da; 


in the current example, 


117 23 t 

v( - t) = 5 e '-2~20 SiD2t+ W COS2t ~2 (> °- 


2 



16.4.2 EXERCISES 


Use Laplace Transforms to solve the following pairs of simultaneous differential equations, 
subject to the given boundary conditions: 


1. 


given that x 

2 . 


given that x 


1 and y 


dx 
d t 


+ 2 y 


dy 

dt 


+ 3 y 


0 when t — 0. 


3 and y 


dx 

dt 


- V 


dy 

dt 

4 when t = 


+ x 

0 . 



x, 


sin t, 
cos t, 


3. 


given that x 
4. 


given that x 


dx 

^ + 2x-3y = 1 , 

d y , „ 

d ~t~ x + 2y = e 


—2t 


0 and y = 0 when t — 0. 


da; 

dt 

dy 

dt 

1 and y — 0 when t — 0. 


2 y, 

8x, 


3 



5 . 


10^ - 3^ + 6x + 5y = 0, 
at at 

da; dy . 

2 dt-Tt +2x + v = 2e ~' 


given that x = 2 and y — — 1 when t = 0. 


6 . 


da; dy 

dt dt 

5 d£ +2 d, 

dt dt 


+ 6 y 
+ 6a; 


0, 

0, 


given that x = 3 and y = 0 when t = 0. 


da; 

dt 

dy 

dt 

d£ 

dt 


2 y, 


2z, 

2x, 


given that x — 1, y — 0 and z = — 1 when t = 0. 
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16.4.3 ANSWERS TO EXERCISES 


x — (2 1 + l)e 1 and y = te . 


x — (t + 4) sin t + 3 cos t and y — (t + 4) cos t — 3 sin t. 


x = 2 — e 2< [1 + x/3 sinht\/3 + cosh 


and 


y=l-e 


- 2 1 


cosht\/3 H 7 = sinht\/3 

\/3 


5. 


6 . 


a: = sinh4t and y — 2 cosh At. 


x = 4 cos t — 2e and y = e — 2 cos t 


x = 2e t + e 2t amd y = e 1 — e 


-t - 2 1 


x = e 




— 7 = sin tV 3 + cos 

a/3 


and 


y = — 1 sint\/3 


V3 


z = e 


-t 


V3 


sm — cos t\Z3 
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UNIT 16.5 - LAPLACE TRANSFORMS 5 
THE HEAVISIDE STEP FUNCTION 

16.5.1 THE DEFINITION OF THE HEAVISIDE STEP FUNCTION 


The Heaviside Step Function, H(t), is defined by the statements 


H(t) = 


0 for t < 0; 

1 for t > 0. 


Note: 

H(t) is undefined when t = 0. 



EXAMPLE 

Express, in terms of H(t), the function, /(£), given by the statements 

f(t) = (° for t < T; 

J[) ll for t > T. 


1 



Solution 


fit) 

1 


w 


T 


i 


Clearly, f(t) is the same type of function as H(t), but we have effectively moved the origin 
to the point (T, 0). Hence, 

f(t) = H(t-T). 


Note: 

The function H(t—T) is of importance in constructing what are known as “pulse functions” 
(see later). 

16.5.2 THE LAPLACE TRANSFORM OF H(t - T) 


From the definition of a Laplace Transform, 

roo 

L[H(t -T)}= / e~ st H{t - T) 
Jo 

rT roo 

= / e~ st .0df+ / e~ st .ldt 
Jo Jt 


d t 


n —st 


— S 


°C e - S T 


Note: 

In the special case when T = 0, we have 

s 

which can be expected since H(t) and 1 are identical over the range of integration. 
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16.5.3 PULSE FUNCTIONS 


If a < b, a “rectangular pulse”, P(t), of duration, b — a, and magnitude, k, is defined by 
the statements, 


p(t) = 


k for a < t < 6; 

0 for t < a or t > b. 



We can show that, in terms of Heaviside functions, the above pulse may be represented by 

Pit) = k[H(t - a) - H(t - 6)]. 

Proof: 

(i) If t < a, then H(t — a) = 0 and H(t — b) = 0. Hence, the above right-hand side = 0. 

(ii) If t > b, then H(t — a) = 1 and H(t — b) = 1. Hence, the above right-hand side = 0. 

(iii) If a < t < b, then H(t — a) = 1 and H(t — b) = 0. Hence, the above right-hand side = k. 

EXAMPLE 

Determine the Laplace Transform of a pulse, P(t), of duration, b — a, having magnitude, k. 

Solution 


p 

L[P(t)] = k — 
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Notes: 


(i) The “strength” of the pulse, described above, is defined as the area of the rectangle 
with base, b — a, and height, k. That is, 

strength = k(b — a). 


(ii) In general, the expression, 

[H(t-a)-H(t-b)]f(t), 

may be considered to “switch on” the function, fit), between t — a and t — b but “switch 
off” the function, f(t), when t < a or t > b. 

(iii) Similarly, the expression, 

H(t — a) fit), 

may be considered to “switch on” the function, fit), when t > a but “switch off” the 
function, fit), when t < a. 

For example, the train of rectangular pulses, Q(t), in the following diagram: 


Qit) 


TB- 


a 3 a 5 a 


may be represented by the function 

Q{t) = k{[H(t) - H{t - a)] + [Hit - 2a) - Hit - 3a)] + [H(t - 4a) - H(t - 5a)] + } . 
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16.5.4 THE SECOND SHIFTING THEOREM 
THEOREM 


L[H(t-T)f(t-T)} = e- sT L[f(t)}. 


Proof: 

Left-hand side = 

p oo 

/ e~ st H{t - T)f(t - T) d t 
Jo 

pT p oo 

= / 0 d t+ e~ st f{t - T) d t 

Jo Jt 

poo 

= J T e ~ st f(t - T ) d t. 

Making the substitution u = t — T, we obtain 

/»nr> 

e s(u+T)f( u ) d U 


= e 


-sT 


/ 0 

roo 


roo 

/ e— /(u)du = e-^L[/(t)]. 


EXAMPLES 

1. Express, in terms of Heaviside functions, the function 

f(t) = { ~ x ) 2 for 1 > !; 

JK J l 0 for 0 < t < 1. 

and, hence, determine its Laplace Transform. 

Solution 

For values of t > 0, we may write 

Therefore, using T = 1 in the second shifting theorem, 

= e-‘L{f] = e-.J 
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2. Determine the inverse Laplace Transform of the expression 


s 2 + 4s + 5 

Solution 

First, we find the inverse Laplace Transform of the expression, 

1 _ 1 
s 2 + 4s + 5 = (s + 2) 2 + l' 

From the first shifting theorem, this will be the function 

e~ 2t sint, t > 0. 

From the second shifting theorem, the required function will be 

H(t - 7)e- 2{t ~ 7) sin (t - 7), t > 0. 


16.5.5 EXERCISES 


1. (a) For values of t > 0, express, in terms of Heaviside functions, the function, 


m = 


e t for 0 < t < 3; 
0 for t > 3. 


(b) Determine the Laplace Transform of the function, f(t), in part (a). 


2. For values of t > 0, express, in terms of Heaviside functions, the function, 


_ / h(t) for 0 <t <a; 

■' \ f 2 (t ) for t > a. 

3. For values of t > 0, express the following functions in terms of Heaviside functions: 


(a) 

(b) 


m 

m = 


it 2 

for 0 < t < 2; 

14 1 

for t > 2. 

sin t 

for 0 < t < 7r; 

sin 2 1 

for 7 t < t < 27t: 

sin 3 1 

for t > 2n. 
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4. Use the second shifting theorem to determine the Laplace Transform of the function, 


/(<) = t 3 H (t - 1). 

Hint: 

Write t 3 = [{t — 1) + l] 3 . 

5. Determine the inverse Laplace Transforms of the following: 
(a) 


(b) 

8e~ 3s _ 
s 2 + 4’ 


s 2 + 3s + 2 ’ 

(d) 


s 2 — 2s + 5" 

6. Solve the differential equation 

r[ 2 r 

- + 4x = H(t-2), 
given that x = 0 and = 1 when t — 0. 

16.5.6 ANSWERS TO EXERCISES 


1. (a) 

(b) 




L[f(t)\ 


g— 3(s+l) 

S + 1 


7 



2 . 


3. (a) 


f(t) = fi(t)[H(t) - Hit - a)] + f 2 (t)H(t - a)]. 
fit) = t 2 [Hit) - H[t - 2)] + 4 tH(t - 2); 


(b) 


4. 


fit) = sin t[Hit) — Hit — 7r )] + sin 2 t[Hit — n) — Hit — 27r)] + sin 3 t[Hit — 2n). 


L[f(t )] 


r 6 6 3 11 

~4 + + ~2 ”b “ 

Ls 4 s 6 s z si 


e 


— S 


5. (a) 

(b) 

(c) 

(d) 


6 . 


Hit - 2 )(*-: 2 ); 


4 Hit — 3)sin2(t — 3); 


Hit- 2)[2e" 2 b- 2) -e" (t - 2) ; 
— 3)e (i ~ 3 ) sin 2 (t — 3). 


1 1 

x = - sin 21 + — 2) [1 


cos 2 (t — 2)]. 
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UNIT 16.6 - LAPLACE TRANSFORMS 6 
THE DIRAC UNIT IMPULSE FUNCTION 

16.6.1 THE DEFINITION OF THE DIRAC UNIT IMPULSE FUNCTION 

A pulse of large magnitude, short duration and finite strength is called an “impulse” . In 
particular, a “unit impulse” is an impulse of strength 1. 

ILLUSTRATION 

Consider a pulse, of duration a, between t — 0 and t = a, having magnitude, A The 
strength of the pulse is then 1. 


1 

a 

( 

T 


V 

7 

a 


From Unit 16.5, this pulse is given by 

Hjt) - Hjt - a) 
a 


If we now allow a to tend to zero, we obtain a unit impulse located at t — 0. This leads to 
the following definition: 

DEFINITION 2 


The “Dirac unit impulse function” , 5{t) is defined to be an impulse of unit strength 
located at t — 0. It is given by 


m 


H[t) -H[t -a) 
lim 

a— >o a 


1 



Notes: 


(i) An impulse of unit strength located at t — T is represented by 8(t — T). 

(ii) An alternative definition of the function 8(t — T) is as follows: 


and 


0 for t ^ T; 

oo for t — T. 


rT+h 

/ S(t — T)dt— 1. 

JT-h 


lim 

h—> 0 Jt - 


2 h 


0 


: : J, 

T-h T + h 


THEOREM 



— T)dt — f(T) if a < T < b. 


Proof: 


Since 8(t — T) is equal to zero everywhere except at t — T, 
formula reduces to 


rT+h 

lim/ f(t)5(t-T) dt. 
h^oJr-h 


the left-hand side of the above 


But, in the small interval from T — h to T + h, f(t) is approximately constant and equal to 
/(T). Hence, the left-hand side may be written 


f(T) 


rT+h 

lim / S(T - T) dt 

h^O J T -h 
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which reduces to f(T), using note (ii) in the definition of the Dirac unit impulse function. 

16.6.2 THE LAPLACE TRANSFORM OF THE DIRAC UNIT IMPULSE 
FUNCTION 

RESULT 


m* - T)} = e- T ', 


and, in particular, 


nm = i. 


Proof: 

From the definition of a Laplace Transform, 

roo 

L[S(t — T) — e~ st 5(t - T) d t. 
Jo 


But, from the Theorem discussed above, with fit) = e st , we have 

L[S(t - T)] = e~ sT . 


EXAMPLES 


1. Solve the differential equation, 

H x 

3- +4 x = 6(t), 

given that x = 0 when t = 0. 

Solution 

Taking the Laplace Transform of the differential equation, 


That is, 


Hence, 


3s A (s) +4X(s) = 1. 


X(s) = 


1 1 


3s + 4 3 s + 


/ x 1 _4t 

x(t) = -e 3 . 


4 ’ 

3 
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2. Show that, for any function, f(t), 

roc 

/ f(t)5'(t-a) dt = -f(a). 

Jo 

Solution 

Using Integration by Parts, the left-hand side of the formula may be written 

r oo 

[f{t)5{t - a)]g° - / - a) d t. 

Jo 

The first term of this reduces to zero, since 5{t — a) is equal to zero except when t = a. 
The required result follows from the Theorem discussed earlier, with T = a. 


16.6.3 TRANSFER FUNCTIONS 

In scientific applications, the solution of an ordinary differential equation having the form 


d 2 x dx 

a &e +h 7 u +cx 


f(t), 


is sometimes called the “response of a system to the function /(£)”. 

The term “system” may, for example, refer to an oscillatory electrical circuit or a mechanical 
vibration. 

It is also customary to refer to f(t) as the “input” and x{t) as the “output” of a system. 

In the work which follows, we shall consider the special case in which x = 0 and = 0 when 
t — 0; that is, we shall assume zero initial conditions. 

Impulse response function and transfer function 

Consider, for the moment, the differential equation having the form, 


d 2 u d u 

“d ¥ + b dt +cu 


m 


Here, we refer to the function, u(t), as the “impulse response function” of the original 
system. 

The Laplace Transform of its differential equation is given by 

(as 2 + bs + c)U(s) = 1. 
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Hence, 


U(s) 


1 

as 2 + bs + c 


which is called the “transfer function” of the original system. 

EXAMPLE 

Determine the transfer function and impulse response function for the differential equation, 


3- +4 x = f{t), 

assuming zero initial conditions. 

Solution 

To find U(s) and u(t), we have 


3 


d u 
d t 


+ 4 u 


St , 


so that 


(3s + 4)H(s) = 1 


and, hence, the transfer function is 


U(s) 


1 

3s + 4 


1 1 
3'^ + T 


Taking the inverse Laplace Transform of U (s) gives the impulse response function, 


u{t) 



4 1 
3 
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System response for any input 

Assuming zero initial conditions, the Laplace Transform of the differential equation 


d^x 

a— — + bx + cx 
at 2 


m 


is given by 


(as 2 + bs + c)A"(s) = F(s ), 


which means that 


X(s) 


Fjs) 

as 2 + bs + c 


F(s).U(s). 


In order to find the response of the system to the function f(t), we need the inverse Laplace 
Transform of F(s).U(s ) which may possibly be found using partial fractions but may, if 
necessary, be found by using the Convolution Theorem referred to in Unit 16.1 

The Convolution Theorem shows, in this case, that 


L 


r f(T).u(t — T) d T 
J o 


F(s).(J(s)\ 


in other words, 


i' 1 |P(i).!'(»)]= [' f(T).u(t-T)iT. 
Jo 
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EXAMPLE 


The impulse response of a system is known to be u(t) = 10d - . 
Determine the response, x(t), of the system to an input of f(t) = sin3t. 
Solution 
First, we note that 


10 , . 3 

U (s) = -T- , ^ and F(s) = 


3(s + 1) 


s 2 + 9' 


Hence, 


A( S ) = 


10 


1 -s + 1 


(s + l)(s 2 + 9) s + 1 s 2 + 9 ’ 


using partial fractions. 
Thus 


x(t) = e — cos 3 1 + - sin 3 1 t > 0. 


Alternatively, using the Convolution Theorem, 


x(t) — [ sin 3 T. 
Jo 


lOe 0 

—3— dr; 


but the integration here can be made simpler if we replace sin3T by e^ T and use the 
imaginary part, only, of the result. 

Hence, 


x(t) = Im (£ ^e- f . e ( 1+ ^ T dr) 


= In 


10 


—t 


e (l+j'3 )T- 

I+J'3 
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/!° 

i 

SO 

1 

03 

1 

CO 

+ 

SO 

1 

03 

i 

U 

1 + J 



(cos 3 t — e f ) + jsin3f](l — j3) 


10 


10 sin 3t — 3 cos 3 1 + 3e 1 
~3 10 


= e 1 — cos 3 1 + 


1 

- sin 3 1 
o 


t > o, 


as before. 

Note: 

Clearly, in this example, the method using partial fractions is simpler. 

16.6.4 STEADY-STATE RESPONSE TO A SINGLE FREQUENCY INPUT 

In the differential equation, 


d 2 x dx 

“ de +b dt +cx 


f(t ), 


suppose that the quadratic denominator of the transfer function, U(s), has negative real 
roots; that is, it gives rise to an impulse response, u(t), involving negative powers of e and, 
hence, tending to zero as t tends to infinity. 

Suppose also that f(t) takes one of the forms, cos cut or smut, which may be regarded, 
respectively, as the real and imaginary parts of the function, e 3i0t . 

It turns out that the response, x(t), will consist of a “transient” part which tends to 
zero as t tends to infinity, together with a non-transient part forming the “steady-state 
response”. 


We illustrate with an example: 



EXAMPLE 


Consider that 


d 2 x 

dt 2 



+ 2x = e j7t , 


where x = 2 and % = 1 when t — 0. 

dt 

Taking the Laplace Transform of the differential equation, 


s{sX{s) - 2) - 1 + 3(sX(s) - 2) + 2 X(s) 


1 

s-J 7' 


That is, 


(s 2 + 3s + 2)X(s) 


2s + 7 + 

-3 7 


giving 


2s + 7 1 2s + 7 1 

s 2 + 3s + 2 + (s-j'7)(s 2 + 3s + 2) " (s + 2)(s + l) + (s - j'7)(s + 2)(s + 1) ' 


Using the “cover-up” rule for partial fractions, we obtain 


X(s) 


5 _ 3 

s 1 s 2 


1 

(-1 - J'7)(s + 1) 


1 

(2 + j7)(s + 2) + 


UU 7 ) 
(s-j7 y 


where 


U(s) 


1 

s 2 + 3s + 2 


is the transfer function. 

Taking inverse Laplace Transforms, 
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x(t) — 5e f - 3e 2t 4 — e 4 + — * — e 2< + U(j7)e jn . 

w -1 - j7 2 + j7 K ' 

The first four terms on the right-hand side tend to zero as t tends to infinity, so that the 
final term represents the steady state response; we need its real part if /(f) = cos 7 1 and its 
imaginary part if /(f) = sin7f. 

Summary 

The above example illustrates the result that the steady-state response, s(f), of a system to 
an input of e^ ut is given by 

s(t) = U{ju)e- 

16.6.5 EXERCISES 
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4. The impulse response function of a system is known to be u(t) = e 3t . 
Determine the response, x(t), of the system to an input of f(t) = 6 cos 3 1. 

5. Determine the steady-state response to the system 


in the cases when 

(a) 

(b) 


3 ¥ + * = W 


f(t) = e 


j2t. 

) 


f(t) = 3 cos 2 1. 


16.6.6 ANSWERS TO EXERCISES 


1 . 


2. (a) 

(b) 


4e~ 


x = sin2f t > 0; 


x = sin t + H(t — 2) sin (t — 2) t ^ 2. 


U(s) = 


2s 2 - 3s + 1 


and u(t) = [ e t — e 2 * 


5. (a) 


(b) 


13 L 


18e 3f — 18 cos 2 1 + 12 sin 2 1 


t > 0. 


(1 - j6)e j2t 
37 


t > 0; 


— (cos 2t + 6 sin 2t) t > 0. 

O i 
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UNIT 16.7 - LAPLACE TRANSFORMS 7 (AN APPENDIX) 

ONE VIEW OF HOW LAPLACE TRANSFORMS MIGHT HAVE ARISEN. 

(i) Let us consider that our main problem is to solve a second order linear differential 
equation with constant coefficients, the general form of which is 

d 2 a; r dx , . 

a dP +b di + cx = fit) - 


(ii) Assuming that the solution of an equivalent first order differential equation, 

da; . . 

°d t +bx = 

has already been included in previous knowledge, we examine a typical worked example as 
follows: 

EXAMPLE 


Solve the differential equation, 


given that x = 0 when t — 0. 

Solution 


da; 

dt 


+ 3a; 


= e 


2 1 


5 


A method called the “integrating factor method” uses the coefficient of x to find a 
function of t which multiplies both sides of the given differential equation to convert it to an 
“exact” differential equation. 

The integrating factor in the current example is e 3t since the coefficient of x is 3. 


We obtain, therefore, 




da; 

dt 


+ 3a; 


= e 


5 1. 


which is equivalent to 


d 
dt L' 


xe 


3 1 


= e 


5 1 


On integrating both sides with respect to t, 


0 5t 


xe 3t = V + C 


1 



or 


,2it 


Putting x = 0 and t = 0, we have 


x = — + Ce~ 3t . 
5 


o = - + c. 

5 


Hence, C = —\ and the complete solution becomes 


x = 


e 2t e -3t 

~5 5~ 


(iii) As a lead up to what follows, we shall now examine a different way of setting out the 
above working in which we do not leave the substitution of the boundary condition until the 
very end. 

We multiply both sides of the differential equation by e 3t as before, but we then integrate 
both sides of the new “exact” equation from 0 to t. 



xe 


3 1 


d t 



e 5t d t. 


That is, 
giving 

since x = 0 when t — 0. 
In other words, 


'3 1 

t 

1 

-+o 

in 

i 

xe 

0 

_ 5 _ 


5 


xe 3t - 0 


e 5t 1 

T ~ 5' 


x = 


e 2t e 3t 


5 5 ’ 


as before. 

(iv) Now let us consider whether an example of a second order linear differential equation 
could be solved by a similar method. 
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EXAMPLE 


Solve the differential equation, 


d t 2 dt 

given that x = 0 and qf = 0 when t — 0. 

Solution 


d 2 a; da; Q , 

10— + 21a; = e 9t , 


Supposing that there might be an integrating factor for this equation, we shall take it to be 
e st where s, at present, is unknown, but assumed to be positive. 

Multiplying throughout by e st and integrating from 0 to t, as in the previous example, 


ft . 

to 

2L 

pt 

g(s+9)£ 

/ e st 

— — 10— — b 21a; 

dt = / e (s+9) * dt = 


Jo 

_ dt 2 dt 

Jo 

s + 9 


Now, using integration by parts, with the boundary condition, 


rt Arp pt 

/ e st — dt = e st x - s / e st x dt 
/ o dt Jo 


and 


/'* ,+ d 2 a; 

L 6 dt 2 


, dx 
dt 


st dx 
lo 6 dt 


st " " d t = e st ^- S l e st ^d t = e st ^-se st x + s 2 e st x dt. 


.da; 

dt 


On substituting these results into the differential equation, we may collect together (on the 
left hand side) terms which involve Jq e st x dt and e st as follows: 


(s 2 + 10s + 21) £ e st x dt + e st 


da; 

dt 


— (s + 10)a; 


e (s+9)t 

s + 9 s + 9’ 


(v) OBSERVATIONS 

(a) If we had used e~ st instead of e st , the quadratic expression in s, above, would have had 
the same coefficients as the original differential equation; that is, (s 2 — 10s + 21). 

(b) Using e~ st with s > 0, if we had integrated from 0 to oo instead of 0 to t, the second 
term on the left hand side above would have been absent, since e~°° = 0. 
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(vi) Having made our observations, we start again, multiplying both sides of the differential 
equation by e~ st and integrating from 0 to oo to obtain 


roo 

(s 2 - 10s + 21) J e~ st x d t = 


— s + 9 


-1 


-s + 9 s — 9 


Of course, this works only if s > 9, but we can easily assume that it is so. Hence, 


e x dt = 


(s - 9)(s 2 - 10s + 21) (s - 9)(s - 3)(s - 7) ' 


Applying the principles of partial fractions, we obtain 


f°° -St - 1 1 11 11 

/ o 12 s — 9 24 s — 3 8 s — 7 


(vii) But, finally, it can be shown by an independent method of solution that 


g9 t e 3 t e 7 t 

x = 12 + 24 _ Y‘ 


and we may conclude that the solution of the differential equation is closely linked to the 
integral 

roo 

/ e~ st x dt, 

Jo 

which is called the “Laplace Transform” of x{t). 
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UNIT 16.8 - Z TRANSFORMS 1 - DEFINITION AND RULES 


16.8.1 INTRODUCTION - Linear Difference Equations 

Closely linked with the concept of a linear differential equation with constant coefficients is 
that of a “linear difference equation with constant coefficients” . 

Two particular types of difference equation to be discussed in the present section may be 
defined as follows: 

DEFINITION 1 

A first-order linear difference equation with constant coefficients has the general form, 


®lAn+l ~b ®(Wn f (jl) > 

where a 0 , cq are constants, n is a positive integer, f(n ) is a given function of n (possibly zero) 
and u n is the general term of an infinite sequence of numbers, {u n } = u 0 , ui, u 2r u 3 , . . . . 

DEFINITION 2 

A second-order linear difference equation with constant coefficients has the general form, 


®2^n+2 U 1 U n J t 1 T a§U n f (/l) , 


where o 0 , o i , a 2 are constants, n is an integer, f(n ) is a given function of n (possibly zero) 
and u n is the general term of an infinite sequence of numbers, {u n } = uq, u i , u 2 ,113, . . . . 

Notes: 

(i) We shall assume that the sequences under discussion are such that u n = 0 whenever 
n < 0. 

(ii) Difference equations are usually associated with given “boundary conditions” , such as the 
value of uq for a first-order equation or the values of uq and u\ for a second-order equation. 

ILLUSTRATION 

Certain simple difference equations may be solved by very elementary methods. 
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For example, suppose that we wish to solve the difference equation, 


u n +i - (n + 1 )u n = 0, 

subject to the boundary condition that Uq = 1. 

We may rewrite the difference equation as 

u n -\~\ (n T 1 )u n 

and, by using this formula repeatedly, we obtain 


u\ = uq = 1, u 2 = 2ui = 2, 113 = 3u 2 = 3x 2, u 4 = 4w 3 = 4 x3x2, . . . . 

In general, for this illustration, u n = n\. 

However, not all difference equations can be solved as easily as this and we shall now discuss 
the Z-Transform method of solving more advanced types. 

16.8.2 STANDARD DEFINITION AND RESULTS 

THE DEFINITION OF A Z-TRANSFORM (WITH EXAMPLES) 

The Z-Transform of the sequence of numbers, {«„,} = Uq, u±, u 2 , m 3 , ... ., is defined by 
the formula, 


OO 

Z{u n } = Y, u r z~ r , 

r = 0 


provided that the series converges (allowing for z to be a complex number if necessary). 

EXAMPLES 

1. Determine the Z-Transform of the sequence, 

{ u n } = {a n }, 


where a is a non- zero constant. 
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Solution 


OO 


ZK} = E 

r=0 


r 

a z 


—r 


That is, 


Z{a n } 


_ a a 2 a 3 
1 H t — ^4 — „ + 


by properties of infinite geometric series. 
Thus, 


1 


1 - 


a 

z 


Z 


z — a 


Z{a n } 


z 


z — a 


2. Determine the Z- Transform of the sequence, 


{u n } = {n}. 


Solution 


OO 


Z{n] = E 

r = 0 


rz 


That is, 


ryr 1 1 2 3 4 

z{n} ~z + 7 + 7 + 7 + ■■■■' 


which may be rearranged as 



by properties of infinite geometric series. 
Thus, 


Z[n } 


z 


z 
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Note: 

Other Z- Transforms may be obtained, in the same way as in the above examples, from the 
definition. 

We list, here, for reference, a short table of standard Z- Transforms, including those already 
proven: 

A SHORT TABLE OF Z-TRANSFORMS 


{U n } 

Z{u n } 

Region of Existence 

{1} 

Z 

z-1 

\z\ > 1 

{a n } (a constant) 

z 

z—a 

\z\ > a 

M 

(*-1)2 

l-l > 1 

je -nT j (T constant) 

z 

z—e~ T 

| z\ > e~ T 

sin nT ( T constant) 

z sinT 

* 2 -2* cosT+1 

\z\ > 1 

cos nT ( T constant) 

z(z— cos T) 
z 2 — 2 *cosT+1 

\z\ > 1 

1 for n — 0 
0 for n > 0 
(Unit pulse sequence) 

1 

All 2 

0 for n — 0 
{a n-1 } for n > 0 

1 

z—a 

\z\ > a 
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16.8.3 PROPERTIES OF Z-TRANSFORMS 


(a) Linearity 


If {u n } and {u n } are sequences of numbers, while A and B are constants, then 


Z{Au n + Bv n } = A.Z{u n } + B.Z{v n }. 


Proof: 

The left-hand side of the above identity is equivalent to 


y (Au r + Bv r )z r 

r = 0 


oo oo 

A ^2 u r z~ r + B ^2 v r z ~ r , 

r=0 r = 0 


which, in turn, is equivalent to the right-hand side. 

EXAMPLE 


Z{ 5.2 n - 3n} 


5z 

z-2 


3z 

(^i T 


(b) The First Shifting Theorem 


Z \u n — i} — .Z^u n ^, 

where {u n _i} denotes the sequence whose first term, corresponding to n = 0, is taken as 
zero and whose subsequent terms, corresponding to n = 1, 2, 3, 4, . . . ., are the terms 
Uq, ui, Us , U4, ... of the original sequence. 


Proof: 

The left-hand side of the above identity is equivalent to 


OO 

J2 U r~l Z 

r = 0 


U 0 U\ 

— + 4 + 

Z Z z 




") 
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since it is assumed that u n = 0 whenever n < 0. 
Thus, 


Z \u n ~ 1 } 


1 

z 


U 1 

Uo H h 

z 


U 2 U 3 

— H — - + 

£ 2 £ 3 


5 


which is equivalent to the right-hand side. 

Note: 

A more general form of the first shifting theorem states that 


Z \u n ~ k} — 

Z K 

where {u n -k} denotes the sequence whose first k terms, corresponding to 
n — 0, 1, 2, . ... ,k — 1, are taken as zero and whose subsequent terms, corresponding to 
n — k, k + 1, k + 2, .... are the terms u 0 , Ui, u 2 , .... of the original sequence. 

ILLUSTRATION 

Given that { u n } = {4 n }, we may say that 


Z{u n - 2 } = ~^-Z{u n } = — . 

z 1 z l z — 4 


1 

z(z- 4)' 


Note: 

In this illustration, the sequence, {u n _ 2 } has terms 0, 0, 1, 4, 4 2 , 4 3 , ... and, by applying 
the definition of a Z-Transform directly, we would obtain 

, , 1 4 4 2 4 3 

Z{u n . 2 } + - + - + 


which gives 


Z{u n } 



1 

z(z — 4) ’ 


by properties of infinite geometric series. 
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(c) The Second Shifting Theorem 


Z{u n+ 1 } = z.Z{u n } - z.u 0 


Proof: 

The left-hand side of the above identity is equivalent to 


—r | u 2 

/, u r+ \Z = U\ H 1 — - + 

Z z l 


W 4 


+ . . . . 


This may be rearranged as 


z. 


Ui 

Uq H h 

z 


u 2 


+ 



U 4 

¥ 


+ . . . 


Z.Uo 


which, in turn, is equivalent to the right-hand side. 

Note: 

This “recursive relationship” may be applied repeatedly. For example, we may deduce 
that 


Z{u n+2 } = z.Z{u n+ 1 } - z.ui = z 2 .Z{u n } - z 2 .u 0 - Z.Ui 


16.8.4 EXERCISES 

1. Determine, from first principles, the Z- Transforms of the following sequences, {u n }: 
(a) 


(b) 


{Un} = {e n }; 


{u n } = {cos 7m}. 
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2. Determine the Z-Transform of the following sequences: 


(a) 

(b) 


K} = {7.(3)"-4.(-in; 

{u n } = |6n + 2e~ 5n | ; 


{u n } = {13 + sin 2 n — cos 2n} . 


3. Determine the Z-Transform of {u n _i} and {w n _ 2 } for the sequences in question 1. 

4. Determine the Z-Transform of {u n+ 1 } and {u n+ 2 } for the sequences in question 1. 


16.8.5 ANSWERS TO EXERCISES 


1. (a) 

(b) 


2. (a) 

(b) 


ez 

ez — 1 ’ 


z 

z + 1 


7 z 4z 
z — 3 z + 1’ 


6 z 2 z 

(z — l) 2 z — e -5 ’ 


13 z 


z(sin 2 + cos 2 — z) 
z 2 — 2^ cos 2 + 1 
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z-1 



3. (a) 

Z{u n - 1} = — e — (■ n > 0), Z{u n - 2 } = -7—- — Jr (n > 1); 
ez — 1 z(ez — 1) 

(b) 

Z{u n -ij = 7^-y (n > 0), Z{u n - 2 } = Y) ( n > X )- 

Note: 

■u_2 = 0 and m_i = 0. 

4. (a) 

Z{Vl ) = ez^~-l’ Z ^ Un+2 ^ = e {ez — 1) ’ 

(b) 

= __ ^ ) ^{^11+2} = __ ^ • 
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UNIT 16.9 - Z TRANSFORMS 2 


INVERSE Z - TRANSFORMS 

16.9.1 THE USE OF PARTIAL FRACTIONS 

When solving linear difference equations by means of Z-Transforms, it is necessary to be able 
to determine a sequence, {u n }, of numbers, whose Z-Transform is a known function, F(z), 
of z. Such a sequence is called the “inverse Z-Transform of F(^)” and may be denoted 
by Z" 1 ^)]. 

For simple difference equations, the function F(z) turns out to be a rational function of 
z, and the method of partial fractions may be used to determine the corresponding inverse 
Z-Transform. 

EXAMPLES 


1. Determine the inverse Z-Transform of the function 


F(z) 


10z(z + 5) 

(z — l)(z — 2)(z + 3) ' 


Solution 

Bearing in mind that 


Z{a n } 


z 


z — a ’ 


for any non-zero constant, a, we shall write 


F(z) = z. 


10(2 + 5) 

(z- l)(z-2)(z + 3) 


which gives 


or 


F(z) = z. 


- -15 14 1 ' 

_z — l + z — 2^~z + 3_ 


F(z) = 


z + 3 


14- 


15- 


Hence, 

Z- 1 [F(^)] = {(— 3) n + 14(2)" - 15} . 
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2. Determine the Inverse Z- Transform of the function 


F(z) = 


1 


z — a 


Solution 

In this example, there is no factor, z, in the function F(z) and we shall see that it is 
necessary to make use of the first shifting theorem. 

First, we may write 


F(z) 


1 r 2 ' 

z [z — a. 


and, since the inverse Z- Transform of the expression inside the brackets is a n , the first 
shifting theorem tells us that 


z-W 


0 when n — 0; 
a n ~ l when n > 0. 


Note: 

This may now be taken as a standard result. 


3. Determine the inverse Z- Transform of the function 


m 


4 ( 2 ^ + 1 ) 

(z + l)(z-3)' 


Solution 

Expressing F(z) in partial fractions, we obtain 


Hence, 


F(z) 


1 7 

z + 1 (z — 3) 


Z -'[Fiz)] 


0 when n — 0; 

(— l) n_1 + 7.(3) n_1 when n > 0. 
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16.9.2 EXERCISES 


1. Determine the inverse Z- Transforms of each of the following functions, F(z): 


faj 

F(z) 

z 

z — l’ 

(b) 

F(z) 

z 

' - z + r 

(c) 

F(z) 

hi 

to 

1 O 

(d) 

F(z) 

z 

“3,2 + 1 

(e) 




F(Z) ~ (z-l)(z + 2Y 


(f) 

F(c) = (2z + 1)(T — 3) ’ 


(g) 

F(z) = (2z + i)( z -iy 


2. Determine the inverse Z- Transform of each of the following functions, F(z), and list the 
first five terms of the sequence obtained: 

(a) 


(b) 


F(z) = 


1 


z — 


1 ; 


F(z) 


z + 2 

z + r 


3 



F(z) ee 


z — 3 


(z-l)(z-2)' 


F(z) = 


2z 2 — 7z + 7 
(z — l) 2 (z — 2) 


16.9.3 ANSWERS TO EXERCISES 


1. (a) 


Z - 1 [F(«)] = {1} 


Z- 1 [F(z)]={(-in 




Z _1 [^)]= O 


1 / 1 


3 V 3 


Z- 1 [FW] = {i[l-(-2)”]}; 


z- 1 [FW] = E 


+*(-*)■} 
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2- (a) 


Z-'IFM] 


0 when n — 0; 

1 when n > 0; 


The first five terms are 0,1, 1,1,1 


(b) 


z-'in*)] = 


1 when n = 0; 

(— l) n_1 when n > 0. 


The first five terms are 1,1,— 1,1,— 1 


(c) 


z- 1 [eWl = {2_ (2rl 


when n — 0; 
when n > 0. 


The first five terms are 0,1,0,— 2,— 6 


(d) 


Z 


0 

3 — 2n + (2) n_1 


when n — 0 
when n > 0 


The first five terms are 0,2, 1,1,3 
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UNIT 16.10 - Z TRANSFORMS 3 

THE SOLUTION OF LINEAR DIFFERENCE EQUATIONS 

Linear difference equations may be solved by constructing the Z- Transform of both sides of 
the equation. The method will be illustrated with linear difference equations of the first and 
second orders (with constant coefficients). 

16.10.1 FIRST ORDER LINEAR DIFFERENCE QUATIONS 
EXAMPLES 

1. Solve the linear difference equation, 


u n + i - 2 u n = (3) n , 

given that uq = 2/5. 

Solution 

First of all, using the second shifting theorem, 

2 

Z{u n+ 1 } = z.Z{u n } - z.-. 

Taking the Z-Transform of the difference equation, we obtain 

2 2 
z.Z{u n j 2 Z{u n } y , 

h z - 3 


so that, on rearrangement, 


vr \ - 2 z , z 

{ “” } 5',-2 + (,_l)(,_2) 


2 z 


5 z-2 


+ z. 


T + 


z ~% z ~ 2 


3 ^ 


z-2 5 z 


i • 


Taking the inverse Z-Transform of this function of z gives the solution 

3 
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2. Solve the linear difference equation, 


Un+l + U n = f (n ) , 


given that 


f(n) = ( 1 when n = 0; 
JK J I 0 when n > 0. 

and Mo = 5. 

Solution 

First of all, using the second shifting theorem, 


Z{u n+ 1 } = z.Z{u n } - z.5 


Taking the Z-Transform of the difference equation, we obtain 


z.Z{u n } - 5z + Z{u n } = 1, 


which, on rearrangement, gives 


Z{u n } 


1 5z 

7 T- 

Z+l Z+ 1 


Hence, 


n } 


5 when n — 0 

( — l) n— 1 + 5(— l) n = 4(— l) n when n > 0 
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16.10.2 SECOND ORDER LINEAR DIFFERENCE EQUATIONS 
EXAMPLES 

1. Solve the linear difference equation 

'U>n + 1 T U j ni 

given that u o = 0 and u\ — 1. 

Solution 

First of all, using the second shifting theorem, 

Z{u n+ 1 } = z.Z{u n - z.O = z.Z{u n } 

and 

Z{u n+2 } = z 2 Z{u n } - z. 1 = z 2 Z{u n } - z. 

Taking the Z-Transform of the difference equation, we obtain 

z 2 .Z{u n } - z = z.Z{u n } + Z{u n }, 
so that, on rearrangement, 

Z i U n} = T i. 

z 2 — z — 1 

which may be written 

Z{Un\ = - —T, 

(z — a)(z — jj) 

where, from the quadratic formula, 




Using partial fractions, 

vi \ 1 r ^ ^ 

Z i u n} = a a 

a — /j [z — a z — p 

Taking the inverse Z-Transform of this function of z gives the solution 
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2. Solve the linear difference equation 


U n + 2 - 7u n+ i + 10 u n = 16 n, 

given that Uo = 6 and u i = 2. 

Solution 

First of all, using the second shifting theorem, 


Z{u n+ 1 } = z.Z{u n } - 6z 


and 


Z{u n+2 } = z 2 .Z{u n } - 6z 2 - 2 z. 
Taking the Z-Transform of the difference equation, we obtain 

z 2 .Z{u n } — 6 z 2 — 2z — 7 [ z.Z{u n } — 6 z] + 10 Z{u n } = 


which, on rearrangement, gives 


Z{u n }[z 2 -7 z + 10] - 6z 2 + 40^ = 


16 z 


(z~ l) 2 ’ 


and, hence, 


Z{u n } = 


16 z 


+ 


6 z 2 — 40 z 


(z — l) 2 (z — 5)(z — 2) (z-5)(z-2] 

Using partial fractions, we obtain 


Z{u n } = z. 


3 4 5 

/ ^ \ o + 


z — 2 z — 5 (z — l) 2 z — 1 
The solution of the difference equation is therefore 


{u n } = {4(2) n - 3(5) n + 4 n + 5} . 


16 z 

z- l) 2 ’ 
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3. Solve the linear difference equation 


U J n+2 T 2u n 0 

given that uq — 1 and u \ = y/2. 

Solution 

First of all, using the second shifting theorem, 

Z{u n+ 2 } = z 2 Z{u n } - z 2 - zV2. 
Taking the Z-Transform of the difference equation, we obtain 

z 2 Z{u n } — z 2 — zVi + 2 Z{u n } = 0, 
which, on rearrangement, gives 


z“ + z\/2 z + v^2 


ry f -) ~ 1 ~ V “ ~ ' V “ V2 

iUn] ~ ~^+2~ ~ ~ Z \z + jV2)(z-jV2) 


Using partial fractions, 

\/ 2 (l +j) 


Z{u n } = z 
so that 


+ 


V2(l -j) 


[j2V2(z - jy/2) -j2V2(z + jV 2) 




(1-J) 
[2(z-jy/2) 


+ 


{«»} = {|(1 — i)(jV2)” + 5(1 + j)(-jV 2 )"} 

= {t(x/ 2 )"[(l-i)(i)" + (l+j)(-i)”]} 


= |-(V / 2) n [V2e~ j ^.e j ^ + V2e j ^.e~ j ^ 


= \ ^ ) n+1 


^ (2n — 1 ) 7r ^.(2n — l)7r 

4 + e _J 4 


Ui(V2r\2co S < 2 u j 2k' 


s |(V2r‘cos (2,i - 1),r '.. 


(1+j) 

2(^+j v / 2)J ’ 
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16.10.3 EXERCISES 


1. Solve the following first-order linear difference equations: 

(a) 

3tln+l + 2 u n = ( — 1)", 

given that u 0 = 0; 

(b) 

u n+1 - 5 u n = 3(2)", 

given that uq = 1; 


^n+1 ~t~ 


(d) 


given that Uq 


i; 


where 


u n + i + 2it n = / (n) , 


\ = f 3 when n = 0; 
J ^ ' 1 0 when n > 0; 

and Uq = 2; 


_ . M 1 n7T 

— 3w n = sin 1 — cos — , 

n+I n 2 2 2 

given that Uq = 0. 

2. Solve the following second-order linear difference equations: 

(a) 

^n+2 2l/ n+ i + Un 0, 

given that — 0 and tii = 1; 

(b) 

u n -\-2 4n n n, 

given that w 0 = 0 and u\ — l\ 
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U n +2 8ll n -\-\ 9^n 24, 


(d) 


given that Uq 


2 and ti\ 


0; 


6M n +2 T U n 20, 


given that u 0 = 3 and u\ — 8; 


u n+ 2 + 2w n+1 — 15w n = 32 coswr, 


(f) 


given that Mq 


0 and Mi 


0; 


Wji+2 3m„ + i 4“ 3m,2 5, 


given that w 0 = 5 and «i = 8. 


16.10.4 ANSWERS TO EXERCISES 


1. (a) 


(b) 


(d) 



K) = {2(5) n - (2) n } ; 



{- u n } = 2(— 2) n + 3(— 2) n_1 when n 


{u n } 



V2cos 


(2 n — l)7r 
4 
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2. (a) 


(b) 


(c) 


(d) 


(e) 


(f) 


{/"«} = W; 




K} = {^(9)” + 3(-l)”-?}; 


W = {2 + (6) 1 -" - 5(-l)"} ; 


K} = {2(-l)" +1 + (3)“ + (-5)"} ; 


{U n } 


{ 6+ (2V3r«co S <^'.. 
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UNIT 17.1 - NUMERICAL MATHEMATICS 1 


THE APPROXIMATE SOLUTION OF ALGEBRAIC EQUATIONS 
17.1.1 INTRODUCTION 


In the work which follows, we shall consider the solution of the equation 

f(x) = 0, 

where f(x) is a given function of x. 

It is assumed that examples of such equations will have been encountered earlier at an 
elementary level; as, for instance, with quadratic equations where there is simple formula for 
obtaining solutions. 


However, the equation 


f(x) = 0 


cannot, in general, be solved algebraically to give exact solutions and we have to be satisfied, 
at most, with approximate solutions. Nevertheless, it is often possible to find approximate 
solutions which are correct to any specified degree of accuracy; and this is satisfactory for 
the applications of mathematics to science and engineering. 


It is certainly possible to consider graphical methods of solving the equation 

f(x) = 0, 

where we try to plot a graph of the equation 

V = f(x), 

then determine where the graph crosses the a;-axis. But this method can be laborious and 
inaccurate and will not be discussed, here, as a viable method. 

Three so called “iterative” methods will be included, below, where repeated use of the 
method is able to improve the accuracy of an approximate solution, already obtained. 


17.1.2 THE BISECTION METHOD 


Suppose a and b are two numbers such that /(a) < 0 and f{b) > 0. We may obtain these 
by trial and error or by sketching, roughly, the graph of the equation 

V = /O), 

in order to estimate convenient values a and b between which the graph crosses the a;-axis; 
whole numbers will usually suffice. 
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.X 


If we let c = (a + b)/ 2, there are three possibilities; 

(i) /(c) = 0, in which case we have solved the equation; 

(ii) f(c) < 0, in which case there is a solution between c and b enabling us repeat the 
procedure with these two numbers; 

(iii) f(c) > 0, in which case there is a solution between c and a enabling us to repeat the 
procedure with these two numbers. 

Each time we apply the method, we bisect the interval between the two numbers being used 
so that, eventually, the two numbers used will be very close together. The method stops 
when two consecutive values of the mid-point agree with each other to the required number 
of decimal places or significant figures. 

Convenient labels for the numbers used at each stage (or iteration) are 
Go, bo, Co, ai, b\, Ci, 02, b- 2 , C2, 03, 63, C3, ..., a n , b n , c n , .... 

EXAMPLE 

Determine, correct to three decimal places, the positive solution of the equation 

e x = x + 2. 

Solution 

We could first observe, from a rough sketch of the graphs of 

y = e x and y = x + 2, 
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that the graphs intersect each other at a positive value of x. This confirms that there is 
indeed a positive solution to our equation. 



But now let 

f(x) = e x — x — 2 

and look for two numbers between which f(x) changes sign from positive to negative. By 
trial and error, suitable numbers are 1 and 2, since 

/(l) = e - 3 < 0 and /( 2) = e 2 - 5 > 0. 


The rest of the solution may be set out in the form of a table as follows: 


n 


b n 

Cn 

/(c n ) 

0 

1.00000 

2.00000 

1.50000 

0.98169 

1 

1.00000 

1.50000 

1.25000 

0.24034 

2 

1.00000 

1.25000 

1.12500 

- 0.04478 

3 

1.12500 

1.25000 

1.18750 

0.09137 

4 

1.12500 

1.18750 

1.15625 

0.02174 

5 

1.12500 

1.15625 

1.14062 

- 0.01191 

6 

1.14063 

1.15625 

1.14844 

0.00483 

7 

1.14063 

1.14844 

1.14454 

- 0.00354 

8 

1.14454 

1.14844 

1.14649 

0.00064 

9 

1.14454 

1.14649 

1.14552 

- 0.00144 


As a general rule, it is appropriate to work to two more places of decimals than that of the 
required accuracy; and so, in this case, we work to five. 


We can stop at stage 9, since c 8 and c 9 are the same value when rounded to three places of 


decimals. The required solution is therefore x = 1.146 
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17.1.3 THE RULE OF FALSE POSITION 


This method is commonly known by its Latin name, “Regula Falsi”, and tries to compen- 
sate a little for the shortcomings of the Bisection Method. 

Instead of taking c as the average of a and b, we consider that the two points, (a, f(a )) and 
(6, /(&)), on the graph of the equation, 


V = }{x), 

are joined by a straight line; and the point at which this straight line crosses the x-axis is 
taken as c. 



From elementary co-ordinate geometry, the equation of the straight line is given by 

V - /(a) x~a 


f{b ) - f( a ) b ~ a 


Hence, when y 


That is, 


Hence, 


0, we obtain 


x = a — 


(j b-a)f(a ) 
f(b) - /(a)' 


a[f(b ) ~ f(a )] ~(b~ a)f(a ) 
f( b ) ~ /(«) 


a fjb) - bf(a) 
fib) - /(a) 
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In setting out the tabular form of a Regula Falsi solution, the c n column uses the general 
formula 


Q'nfipn) bnf (®n) 

/ (b n ) ~ f(a n ) 


EXAMPLE 


For the equation 


f(x) = x 3 + 2x — 1 = 0 


use the Regula Falsi method with ao = 0 and b 0 = 1 to determine the first approximation, 
Co, to the solution between x = 0 and x = 1. 


Solution 


We have /( 0) = —1 and /( 1) = 2, so that there is certainly a solution between x = 0 and 
x — 1. 


From the general formula, 

_ 0 x 2 - 1 x (-1) _ 1 
C ° ““ 2- (-1) ~~ 3 

and, if we were to continue with the method, we would observe that /( 1/3) < 0 so that 
Gq = 1/3 and b\ = 1. 

Note: 

The Bisection Method would have given c 0 = 

17.1.4 THE NEWTON-RAPHSON METHOD 


This method is based on the guessing of an approximate solution, x = aq, to the equation 
f(x) = 0. 

We then draw the tangent to the curve whose equation is 

V = f(x) 

at the point xq, f(x o) to find out where this tangent crosses the axaxis. The point obtained 
is normally a better approximation X\ to the solution. 
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In the diagram, 
Hence, 


,/, \ AB f{x o) 

/(I » ) = AC = — 

h = /(I ” ) 


f'(x 0 y 

so that a better approximation to the exact solution at point D is given by 

x\ = xq — h. 

Repeating the process, gives rise to the following iterative formula: 

f{x n ) 


^n+1 %r 


f'M' 


Notes: 

(i) To guess the starting approximation, x 0 , it is normally sufficient to use a similar technique 
to that in the Bisection Method; that is, we find a pair of whole numbers, a and b, such 
that /(a) < 0 and f(b) > 0; then we take x 0 = (a + b)/2. In some exercises, however, 
an alternative starting approximation may be suggested in order to speed up the rate of 
convergence to the final solution. 

(ii) There are situations where the Newton- Raphson Method fails to give a better approx- 
imation; as, for example, when the tangent to the curve has a very small gradient, and 
consequently meets the x-axis at a relatively great distance from the previous approxima- 
tion. In this Unit, we shall consider only examples in which the successive approximations 
converge rapidly to the required solution. 
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EXAMPLE 


Use the Newton- Raphson method to calculate a/5, correct to three places of decimals. 

Solution 

We are required to solve the equation 

f(x) = x 2 — 5 = 0. 


By trial and error, we find that a solution exists between x = 2 and x = 3 since /( 2) = —1 < 0 
and /( 3) = 4 > 0. Hence, we use x 0 = 2.5 


Furthermore, 
so that 


f'(x) 


^n+1 


2x, 

^n~5 

2x n 


Thus, 


1 25 

X! = 2.5 - — = 2.250, 

x 2 = 2.250 - °^ 5 ~ 2.236, 

x 3 = 2.236 - "°- 000304 ^ 2.236 
3 4.472 


At each stage, we round off the result to the required number of decimal places and use the 
rounded figure in the next iteration. 

The last two iterations give the same result to three places of decimals and this is therefore 
the required result. 
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17.1.5 EXERCISES 


1. Determine the smallest positive solution to the following equations (i) by the Bisec- 
tion Method and (ii) by the Regula Falsi Method, giving your answers correct to four 
significant figures: 

(a) 

x — 2sin 2 a: = 0; 

(b) 

e x — cos(a; 2 ) — 1 = 0. 

2. Use the Newton- Raphson Method to determine the smallest positive solution to each 
of the following equations, correct to five decimal places: 

(a) 

x 4 = 5; 

(b) 

x 3 + x 2 — 4x + 1 = 0; 

(c) 

x — 2 = In x. 


17.1.6 ANSWERS TO EXERCISES 


1. (a) 

(b) 

2. (a) 

(b) 


x — 1.849; 


x — 0.6486 


x ~ 1.49535; 


x ~ 0.27389; 


x — 3.14619 
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UNIT 17.2 - NUMERICAL MATHEMATICS 2 


APPROXIMATE INTEGRATION (A) 

17.2.1 THE TRAPEZOIDAL RULE 

The rule which is explained below is based on the formula for the area of a trapezium. If the 
parallel sides of a trapezium are of length p and q while the perpendicular distance between 
them is r, then the area A is given by 

A = r(p + q) 



Let us assume first that the curve whose equation is 

V = f{x) 

lies wholly above the x-axis between x = a and x = b. It has already been established, in 
Unit 13.1, that the definite integral 

[ f{x) dx 

J a 

can be regarded as the area between the curve y — f(x) and the x-axis from x = a to x = b. 

However, suppose we divided this area into several narrow strips of equal width, h, by 

marking the values xi,x 2 ,x 3 , , x n along the x-axis (where xq — a and x n = b ) and 

drawing in the corresponding lines of length yi, y 2 , 17 . 3 , , y n parallel to the y-axis. 
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Each narrow strip of width h may be considered approximately as a trapezium whose parallel 
sides are of lengths y t and y i+ i, where i = 1, 2, 3, ,n — 1. 

Thus, the area under the curve, and hence the value of the definite integral, approximates 
to 

h 

2 [(2/l + V 2 ) + (y-2 + Vo) + (?/3 + I/ 4 ) + + (jjn—l + 2/n)]- 

That is, 

rb ]i 

/ /(x) dx ~ -[j/i + y n + 2 ( 2/2 + 1/3 + 2/4 + + Z/n-i)]; 

-/a Z 

or, what amounts to the same thing, 



h 

2 


[First + Last + 2 x 


The Rest]. 


Note: 

Care must be taken at the beginning to ascertain whether or not the curve y = f(x) crosses 
the x-axis between x = a and x = b. If it does, then allowance must be made for the fact that 
areas below the x-axis are negative and should be calculated separately from those above 
the x-axis. 

EXAMPLE 

Use the trapezoidal rule with five divisions of the x-axis in order to evaluate, approximately, 
the definite integral: 

f 1 e x2 dx. 

Jo 


2 



Solution 


First we make up a table of values as follows: 


X 

0 

0.2 

0.4 

0.6 

0.8 

1.0 

e xZ 

1 

1.041 

1.174 

1.433 

1.896 

2.718 


Then, using h = 0.2, we have 


ri , 0 2 

/ e x dx~— [1 + 2.718 + 2(1.041 + 1.174 + 1.433 + 1.896)] ~ 1.481 
Jo 2 


17.2.2 EXERCISES 

Use the trapezoidal rule with six divisions of the x-axis to determine an approximation for 
each of the following, working to three decimal places throughout: 


1 . 


2 . 


3. 


4. 


5. 


x In x dx. 


. 7-2 5 — x 2 


dx. 



cotx dx. 


sin \/x 2 + 1 dx. 



ln(l + sinx) dx. 


17.2.3 ANSWERS TO EXERCISES 

1. 35.836 2. 0.931 3. 0.348 4. 1.468 5. 0.737 
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UNIT 17.3 - NUMERICAL MATHEMATICS 3 
APPROXIMATE INTEGRATION (B) 

17.3.1 SIMPSON’S RULE 

A better approximation to 

[ f{x) dx 

J a 

than that provided by the Trapezoidal rule (Unit 17.2) may be obtained by usingan even 
number of narrow strips of width, /?,, and considering them in pairs. 

To begin with, we examine a special case in which the first strip lies to the left of the y-axis 
as in the following diagram: 



The arc of the curve passing through the points A(— h, yi), B(0, 2 / 2 ) and C(h, y$) may be 
regarded as an arc of a parabola whose equation is 

y = Lx 2 + Mx + N, 

provided that the coefficients L , M and N satisfy the equations 


1 



yi = Lh 2 -Mh + N, 
V2 = N, 

y 3 = Lh 2 + Mh + N. 


Also, the area of the first pair of strips is given by 


rh 

Area = / (Lx 2 + Mx + N) dx 
J-h 


Q 9 

CC'~* cc^ 

L b M b Nx 

3 2 


l h 


J - h 


2 Lh 3 


2Nh 


= } l[2Lh 2 + 6N], 


which, from the simultaneous equations earlier, gives 


Area = ^ [yi + y 3 + % 2 ] • 


But the area of every pair of strips will be dependent only on 
co-ordinates, together with the value of h. 

Hence, the area of the next pair of strips will be 


[y.s + y.5 + 4y 4 ] , 


and the area of the pair after that will be 


the three corresponding y 


2 



h 

3 


[Vb + V7 + %e]- 


Thus, the total area is given by 


fl 

Area = ~[yi + y n + 4(y 2 + 2/4 + 2/e + • • •) + %3 + 2/5 + 2/7 + • ■ •)> 


usually interpreted as 


fl 

Area = — [First+Last+4xThe even numbered y co — ords.+2xThe remaining y co — ords. 
3 


or 


Area = — [F + L + 4E + 2R] 

3 

This result is known as SIMPSON’S RULE. 

Notes: 

(i) Since the area of the pairs of strips depends only on the three corresponding y co-ordinates, 
together with the value of h , the Simpson’s rule formula provides an approximate value of 
the definite integral 



f(x ) da; 


whatever the values of a and b are, as long as the curve does not cross the oxaxis between 
x = a and x = b. 

(ii) If the curve does cross the oxaxis between x = a and x — b, it is necessary to consider 
separately the positive parts of the area above the x-axis and the negative parts below the 
axaxis. 

(iii) The approximate evaluation, by Simpson’s rule, of a definite integral should be set out 
in tabular form, as illustrated in the examples overleaf. 


3 



EXAMPLES 


1. Working to a maximum of three places of decimals throughout, use Simpson’s rule with 
ten divisions to evaluate, approximately, the definite integral 


(\x. 


Solution 


Xi 

X 1 

Vi = e • 

F & L 

E 

R 

0 

1 

1 



0.1 

1.010 


1.010 


0.2 

1.041 



1.041 

0.3 

1.094 


1.094 


0.4 

1.174 



1.174 

0.5 

1.284 


1.284 


0.6 

1.433 



1.433 

0.7 

1.632 


1.632 


0.8 

1.896 



1.896 

0.9 

2.248 


2.248 


1.0 

2.718 

2.718 



F + L -> 

4E -> 

2R -> 

(F + L) + 4E + 2R -> 

3.718 

7.268 

5.544 

29.072 

x4 

x2 

11.088 

29.072 

11.088 

43.878 

////// 

////// 


Hence, 


f e x2 da; ~ ^ x 43.878 ~ 1.463 
Jo 3 

2. Working to a maximum of three places of decimals throughout, use Simpson’s rule with 
eight divisions between x — — 1 and x — 1 and four divisions between x — 1 and x = 2 
in order to evaluate, approximately, the area between the curve whose equation is 

y = (x 2 - l)e~ x 

and the axaxis from x = —1 to x = 2. 

Solution 

We note that the curve crosses the x-axis when x = — 1 and when x — 1, the y co- 
ordinates being negative in the interval between these two values of x and positive 
outside this interval. 
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Hence, we need to evaluate the negative area between x — — 1 and x = 1 and the 
positive area between x = 1 and x = 2; then we add their numerical values together to 
find the total area. 


V 


0 


x 


(a) The Negative Area 


Xi 

Hi = (x 2 - l)e~ x 

F & L 

E 

R 

-1 

0 

0 



-0.75 

-0.926 


-0.926 


-0.5 

-1.237 



-1.237 

-0.25 

-1.204 


-1.204 


0 

-1 



-1 

0.25 

-0.730 


-0.730 


0.50 

-0.455 



-0.455 

0.75 

-0.207 


-0.207 


1 

0 

0 



F + L -> 

4E -> 

2R -»• 

(F + L) + 4E + 2R -> 

0 

-2.860 

-2.692 

-11.440 

x4 

x2 

-5.384 

-11.440 

-5.384 

-16.824 

////// 

////// 
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(b) The Positive Area 


Xi 

Vi = ( x 2 - l)e~ x 

F & L 

E 

R 

1 

0 

0 



1.25 

0.161 


0.161 


1.5 

0.279 



0.279 

1.75 

0.358 


0.358 


2 

0.406 

0.406 



F + L -> 

4E -> 

2R -»• 

(F + L) + 4E + 2R -> 

0.406 

0.519 

0.279 

2.076 

x4 

x2 

0.558 

2.076 

0.558 

3.040 

////// 

////// 


The total area is thus 


0.25 

IT 


x (16.824 + 3.040) ~ 1.655 


17.3.2 EXERCISES 

Use Simpson’s rule with six divisions of the axaxis to find an approximation for each of the 
following, working to a maximum of three decimal places throughout: 


1 . 


2 . 


3. 


4. 


5 . 


x In x da;. 


"i 1 


1-2 5-x 2 


da;. 


cot x da;. 


sin V x 2 + 1 da;. 


ln(l + sin a;) da;. 
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17.3.3 ANSWERS TO EXERCISES 

1 . 35.678 2 . 0.882 3 . 0.347 4 . 1.469 5 . 0.743 
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UNIT 17.4 - NUMERICAL MATHEMATICS 4 
FURTHER GAUSSIAN ELIMINATION 


The elementary method of Gaussian Elimination, for simultaneous linear equations, was 
discussed in Unit 9.4. We introduce, here, a more general method, suitable for use with 
sets of equations having decimal coefficients. 

17.4.1 GAUSSIAN ELIMINATION BY “PARTIAL PIVOTING” WITH A CHECK 
COLUMN 

Let us first consider an example in which the coefficients are integers. 

EXAMPLE 

Solve the simultaneous linear equations 


2 x + y + z = 3, 
x — 2 y — z = 2, 

3 x — y + z = 8. 

Solution 

We may set out the solution, in the form of a table (rather than a matrix) indicating each 
of the “pivot elements” in a box as follows: 

E 
0 

11 

=7_ 

2 

1 

_2 

4 

INSTRUCTIONS 

(i) Divide the coefficients of x in lines 2 and 3 by the coefficient of x in line 1 and write the 
respective results at the side of lines 2 and 3; (that is, \ and | in this case). 

(ii) Eliminate x by subtracting \ times line 1 from line 2 and f times line 1 from line 3. 


x 


I 1 

JLa 


-2 

-1 


^5 

2 


^3 
2 

-1 

2 2 


constant 

3 

2 


1 



(iii) Repeat the process starting with lines 4 and 5. 

(iv) line 6 implies that z — 3 and by substitution back into earlier lines, we obtain the values 
y = — 2 and x — 1. 

OBSERVATIONS 

Difficulties could arise if the pivot element were very small compared with the other quantities 
in the same column, since the errors involved in dividing by small numbers are likely to be 
large. 

A better choice of pivot element would be the one with the largest numerical value in its 
column. 

We shall consider an example, now, in which this choice of pivot is made. The working will 
be carried out using fractional quantities; though, in practice, decimals would normally be 
used instead. 

EXAMPLE 

Solve the simultaneous linear equations 


x — y + 2z = 5, 
2 x + y — z = 1, 
x + 3y — z = 4. 


Solution 



X 

y z 

constant 

E 

1 

2 

1 

-1 2 

5 

7 



2 

1 - 1 

1 

3 

1 

2 

1 

3 -1 

4 

7 


On eliminating x, we obtain the new table: 



y 

Z 

constant 

E 

— 3 

—3 

5 

9 

n 

5 

2 

2 

2 

2 



5 

-1 

7 

11 



2 

2 

2 

2 


Eliminating y takes us to the final table as follows: 


2 



z 

constant 

E 

11 

33 

44 

5 

5 

5 


We conclude that 
llz = 33 and, hence, 



Substituting into the second table (either line will do), we have 


5y — 3 = 7 and, hence, y — 2 


Substituting into the original table (any line will do), we have 


x — 2 + 6 = 5, so that \ x = 1 


Notes: 


(i) In questions which involve decimal quantities stated to n decimal places, the calculations 
should be carried out to n + 2 decimal places to allow for rounding up. 


(ii) A final check on accuracy in the above example is obtained by adding the original three 
equations together and verifying that the solution obtained also satisfies the further equation 

4x + 3y = 10. 


(iii) It is not essential to set out the solution in the form of separate tables (at each step) 
with their own headings. A continuation of the first table is acceptable. 

17.4.2 EXERCISES 


1. Use Gaussian Elimination by Partial Pivoting with a check column to solve the following 
sets of simultaneous linear equations: 


(a) 

X\ + 2x 2 + 3x 3 = 5, 
3xi - x 2 + 2 x 3 = 8, 
4xi — 6 x 2 — 4^3 = —2. 

(b) 

5*i — *2 + 2i 3 = 3, 
2ii + 4i 2 + * 3 = 8, 
i\ + 3*2 — 3*3 = 2; 


3 




-4, 

33, 

3; 


*i + 2i 2 + 3*3 — 

2*i + 6*2 - 3*3 = 

4*1 — 2*2 + *3 = 

(d) 

7*i - 4 * 2 = 12, 

-4*i + 12*2 - 6*3 = 0, 

-6*2 + 14*3 = 0; 

2. Use Gaussian Elimination with Partial Pivoting and a check column to solve the fol- 
lowing sets of simultaneous linear equations: 

(a) 

1.202xi-4.371x 2 + 0.651x 3 = 19.447, 

-3.141x1 + 2.243x2 - 1.626x3 = -13.702, 

0.268xi — 0.876x 2 + 1.341x 3 = 6.849; 

(b) 

-2.381x1 + 1.652x2 - 1.243x3 = 12.337, 

2.151xi- 3.427x 2 + 3.519x 3 = 9.212, 

1.882x1 + 2.734x2 - 1.114x3 = 5.735; 


17.4.3 ANSWERS TO EXERCISES 

1. (a) xi = -1, x 2 = -3, x 3 = 4; 

(b) *i = 0.5, * 2 = 1.5, * 3 = 1.0; 

(c) *i = 3.0, * 2 = 2.5, * 3 = -4.0; 

(d) *i = 2.26, * 2 = 0.96, * 3 = 0.41 

2. (a) xi = 0.229, x 2 = —4.024, X3 = 2.433; 
(b) xi = -5.753, x 2 = 14.187, x 3 = 19.951 
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UNIT 17.5 - NUMERICAL MATHEMATICS 5 


ITERATIVE METHODS FOR SOLVING SIMULTANEOUS LINEAR 
EQUATIONS 

17.5.1 INTRODUCTION 

An iterative method is one which is used repeatedly until the results obtained acquire a 
pre-assigned degree of accuracy. For example, if results are required to be accurate to five 
places of decimals, the number of “iterations” (that is, stages of the method) is continued 
until two consecutive iterations give the same result when rounded off to that number of 
decimal places. It is usually enough for the calculations themselves to be carried out to two 
extra places of decimals. 

A similar interpretation holds for accuracy which requires a certain number of significant 
figures. 

In the work which follows, we shall discuss two standard methods of solving a set of simul- 
taneous linear equations of the form 

a\X + biy + C\Z = ki, 
a 2 x + b 2 y + c 2 z = k 2 , 
a 3 x + h V + c z z = k 3 , 


when the system is “diagonally dominant”, which, in this case, means that 


M 

> 

\b±\ + ci . 

N 

> 

K + c 2 

N 

> 

KI + KI 


The methods would be adaptable to a different number of simultaneous equations. 
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17.5.2 THE GAUSS-JACOBI ITERATION 


This method begins by making x the subject of the first equation, y the subject of the second 
equation and z the subject of the third equation. 

An initial approximation such as x 0 = 1, y 0 — 1, z 0 — 1 is substituted on the new right-hand 
sides to give values x — Xi, y — y\ and z — Z\ on the new left-hand sides. 

A continuation of the process leads to the following general scheme for the results of the 
(n + l)-th iteration: 


^n+1 


Un -\- 1 


%n -\- 1 


— (A:i 

a i 



C 3 


- hy n - Ciz n ) , 
%n ^2 Z n ) 7 

•En ^3 Un ) • 


This scheme will now be illustrated by numerical examples: 

EXAMPLES 

1. Use the Gauss-Jacobi method to solve the simultaneous linear equations 


5 x + y — z = 4, 
x + Ay + 2z = 15, 
x — 2y + 5z = 12, 


obtaining x, y and z correct to the nearest whole number. 

Solution 

We have 


— 0.8 — 0.2 y n + 0.2 z n , 

= 3.75 - 0.25x n - 0.5 z n , 

= 2.4 - 0.2a; n + 0Ay n . 


%n + 1 
2/n+l 
Zn -\- 1 
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Using 

x 0 = l,j/o = l,2o = 1, 

we obtain 

X\ = 0.8, 2/1 = 3.0, z\ = 2.6, 
x 2 = 0.72, y 2 = 2.25, z 2 = 3.44, 
a; 3 = 1.038,|/3 = 1.85,2s = 3.156 
The results of the last two iterations both give 

X = l,y = 2,2 = 3, 

when rounded to the nearest whole number. 

In fact, these whole numbers are clearly seen to be the exact solutions. 

2. Use the Gauss-Jacobi method to solve the simultaneous linear equations 

x + 7y — z = 3, 

5 x + y + z = 9, 

-3x + 2y + 7z = 17, 

obtaining x, y and 2 correct to the nearest whole number. 

Solution 

This set of equations is not diagonally dominant; but they can be rewritten as 

7 y + x — z = 3, 
y + 5x + z = 9, 

2y — 3x + 7z = 17, 

which is a diagonally dominant set. We could also interchange the first two of the 
original equations. 

We have now 

y n + 1 = 0.43 - 0.14x n + 0.14 z n , 

x n+ i = 1.8 - 0.2 y n - 0.2 z n , 

z n+ 1 = 2.43 + 0A3x n — 0.29 y n . 

3 



Using 


Vo = l,x 0 = Mo = 1, 


we obtain 

yi = 0.43, x\ = 1.4, z\ = 2.57, 
y 2 = 0.59, £2 = 1.2, z 2 = 2.91, 

?/3 = 0.67, x 3 = 1.1, z 3 = 2.78 

This is now enough to conclude that x — l,y — 1, z — 3 to the nearest whole number 
though, this time, they are not the exact solutions. 

17.5.3 THE GAUSS-SEIDEL ITERATION 

This method differs from the Gauss-Jacobi Iteration in that successive approximations are 
used within each step as soon as they become available. 

It turns out that the rate of convergence of this method is usually faster than that of the 
Gauss-Jacobi method. 

The scheme of the calculations is according to the following pattern: 


V n ■ 1 
Dn+1 
Zn+1 


— {kl - biy n - dZn) , 

a i 

7 (^2 ®2-2'n+l C-2 Z n ^ , 

b 2 

(^3 ®3*^r) + l b 3 y n+ i ) . 

C 3 


EXAMPLES 

1. Use the Gauss-Seidel method to solve the simultaneous linear equations 


5 x + y — z = 4, 
x + 4y + 2z = 15, 
x — 2y + 5z = 12. 
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Solution 

This time, we write: 


x n+ i = 0.8 - 0.2 y n + 0.2z n , 

Vn+ 1 = 3.75 - 0.25a; n+ i - 0.5z n , 

Zn + 1 2.4 0.2x n+ i T 0.4?/ n _|_i, 


and the sequence of successive results is as follows: 

x 0 = l,j/o = Mo = 1, 

X] = 0.8, 2/1 = 3.05, 2:1 = 3.46, 
x 2 = 0.88,|/2 = 1.80, = 2.94, 

x 3 = 1.03, y 3 = 2.02, z 3 = 3.00 

In this particular example, the rate of convergence is about the same as for the Gauss- 
Jacobi method, giving x — 1, y — 2, z — 3 to the nearest whole number; but we would 
normally expect the Gauss-Seidel method to converge at a faster rate. 

2. Use the Gauss Seidel method to solve the simultaneous linear equations: 


7 y + x — z = 3, 
y + 5x + z = 9, 

2y — 3x + 7z = 17. 

Solution 

These equations give rise to the following iterative scheme: 

y n + i = 0.43 — 0.14x n +0.14 z n , 

x n -\~i 1.8 0.2 y n -\-\ 0.2z n , 

z n+ 1 = 2.43 + 0.43x n+ i - 0.29y n+ i, 
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The sequence of successive results is: 

Vo = Mo = Mo = 1, 
yi = 0.43, x\ = 1.51, z\ = 2.96, 
y 2 = 0.63, x 2 = 1.08, 32 = 2.71, 
y 3 = 0.66, x 3 = 1.13, z 3 = 2.73 

Once more, to the nearest whole number, the solutions are x — 1, y — 1, z — 3. 

17.5.4 EXERCISES 

1. Setting xq = y 0 = z 0 = 1 and working to three places of decimals, complete four 
iterations of 

(a) The Gauss-Jacobi method 
and 

(b) The Gauss-Seidel method 

for the system of simultaneous linear equations 

7x — y + z = 7.3, 

2x — 8y — z = —6.4, 
x + 2y + 9z = 13.6 

To how many decimal places are your results accurate ? 

2. Rearrange the following equations to form a diagonally dominant system and perform 
the first four iterations of the Gauss-Seidel method, setting x 0 — 1, y 0 — 1 and z 0 — 1 
and working to two places of decimals: 

x + 5y — z = 8, 

—9a; + 3y + 2z = 3, 
x + 2y + 7z = 26. 

Estimate the accuracy of your results and suggest the exact solutions (checking that 
they are valid). 
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3. Use an appropriate number of iterations of the Gauss-Seidel method to solve accurately, 
to three places of decimals, the simultaneous linear equations 


7x + y + z = 5, 
— 2x + 9y + 3z = 4, 
x + 4y + 8z = 3. 


17.5.5 ANSWERS TO EXERCISES 

1. (a) X4 ~ 1.014, 1/4 ~ 0.850, Z4 ~ 1.199, which are accurate to one decimal place. 

(b) X4 ~ 0.999, j / 4 ~ 0.900, Z4 ~ 1.200, which are accurate to two decimal places. 

2. On interchanging the first two equations, aq ~ 0.98, y 4 ~ 2.00, Z4 ~ 2.99, which are 
accurate to the nearest whole number. The exact solutions are x — 1, y — 2, z — 3. 

3. x ~ 0.630, y ~ 0.582, z — 0.004. 
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UNIT 17.6 - NUMERICAL MATHEMATICS 6 


NUMERICAL SOLUTION OF ORDINARY DIFFERENTIAL 
EQUATIONS (A) 

17.6.1 EULER’S UNMODIFIED METHOD 


Every first order ordinary differential equation can be written in the form 


d y_ 

da; 




and, if it is given that y = yo when x = Xq, then the solution for y in terms of x represents 
some curve through the point P 0 (a;o,|/o)- 


Suppose that we wish to find the solution for y at x = d, where d > x 0 . 


We sub-divide the interval from x = xq to x = d into n equal parts of width, 5x. 


V 


0 



X 


Xq d 


Letting xi,x 2 ,x 3 , ... be the points of subdivision, we have 

X\ = Xq + 5x, 

x 2 = x 0 + 2 <5x, 

X 3 = Xq + 3 5x, 

•••? 

• • •? 

d — x n — x o + ndx. 

If y x , ;(/ 2 , y. 3 , ••• are the y co-ordinates of xi, x 2 , x%, ..., we are required to find y n . 
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From elementary calculus, the increase in y, when x increases by 5x, is given approximately 
by and since, in our case, 4^ = f(x,y ), we have 

2/i = 2/o + f(x 0 ,y 0 )Sx , 

1/2 = 2/i + /(x^y^Sx, 

2/3 = 2/2 + f(x 2 ,y 2 )Sx, 

* * *? 

2 In Un—l "h /(X n _i,2/ n -l)^, 

each stage using the previously calculated y value. 

Note: 

The method will be the same if d < Xo, except that da; will be negative. 

In general, each intermediate value of y is given by the formula 


2/m = 2/i + f(xi,yi)$x- 


EXAMPLE 


Use Euler’s method with 5 sub- intervals to continue, to x 
equation, 

dy 


given that y 


1 when x 


da; 


= xy, 


0; (that is, y(0) = 1). 


0.5, the solution of the differential 


Solution 


i 

Xi 

Vi 

f(xi,Vi) 

2/m = 2 h + f{xi,yi)Sx 

0 

0 

1 

0 

1 

1 

0.1 

1 

0.1 

1.01 

2 

0.2 

1.01 

0.202 

1.0302 

3 

0.3 

1.0302 

0.30906 

1.061106 

4 

0.4 

1.061106 

0.4244424 

1.1035524 

5 

0.5 

1.1035524 

- 

- 


Accuracy 

The differential equation in the above example is simple to solve by an elementary method, 


2 




such as separation of the variables. It is therefore useful to compare the exact result so 
obtained with the approximation which comes from Eulers’ method. 


Therefore 


that is, 


At x = 0, we are told that y 


I 


d y 
y 


In y 


■ / 

x 2 

T 


xdx. 

+ c-, 


y = Ae 2 . 

1 and, hence, A — 1, giving 

si 

y = e 2 . 


Bnt a table of values of x against y in the previous interval reveals the following: 


X 

e 2 

0 

1 

0.1 

1.00501 

0.2 

1.0202 

0.3 

1.04603 

0.4 

1.08329 

0.5 

1.13315 


There is thus an error in our approximate value of 0.0296, which is about 2.6%. Attempts 
to determine y for values of x which are greater than 0.5 would result in a very rapid growth 
of error. 

17.6.2 EULER’S MODIFIED METHOD 

In the previous method, we used the gradient to the solution curve at the point Po in order 
to find an approximate position for the point Pi, and so on up to P n . 

But the approximation turns out to be much better if, instead, we use the average of the 
two gradients at P 0 and Pi for which we need use only x 0 , y 0 and Sx in order to calculate 
approximately. 
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V 


Po 

e L 

Xq 


Pi 

K X 

X\ 


The gradient, m 0 , at P 0 , is given by 

m 0 = f(x 0 ,y 0 ). 

The gradient, m i, at Pi, is given approximately by 

mi = f(x o + Sx,y 0 + 5y 0 ), 

where Sy 0 = f(x 0 ,y 0 )Sx. 

Note: 

We cannot call y 0 + 5y 0 by the name y \ , as we did with the unmodified method, because this 
label is now reserved for the new and better approximation at x = Xq + Sx. 

The average gradient, between Po and Pi, is given by 

ml = ^(m 0 + mi). 

Hence, our approximation to y at the point Pi is given by 

U\ = yo + m* 0 Sx. 

Similarly, we proceed from iji to y 2 , and so on until we reach y n . 

In general, the intermediate values of y are given by 

Vi + 1 = Vi + m*Sx. 
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EXAMPLE 


Solve the example in the previous section using Euler’s Modified method. 


Solution 


i 

Xi 

Vi 

mi = 

f(xi,Vi) 

% = 
f{xi,yi)5x 

m i+ 1 = 

f(xi +5x,yi +6yi) 

m* = 

| (mt + m i+ 1 ) 

Vi + 1 = 
yi + m* 5x 

0 

0 

1 

0 

0 

0.1 

0.05 

1.005 

1 

0.1 

1.005 

0.1005 

0.0101 

0.2030 

0.1518 

1.0202 

2 

0.2 

1.0202 

0.2040 

0.0204 

0.3122 

0.2581 

1.0460 

3 

0.3 

1.0460 

0.3138 

0.0314 

0.4310 

0.3724 

1.0832 

4 

0.4 

1.0832 

0.4333 

0.0433 

0.5633 

0.4983 

1.1330 

5 

0.5 

1.1330 







17.6.3 EXERCISES 


1 . 


(a) Taking intervals 5x 
that 


0.2, use Euler’s unmodified method to determine y(l), given 


d y 

da; 


+ y 


0, 


and that y(0) — 1 

Compare your solution with the exact solution given by 


y 


= e 


— X 


(b) Taking intervals 8x 
that 


0.1, use Euler’s unmodified method to determine y(l), given 

d y x 2 + y 

dx x 


and that y( 0.5) = 0.5. 

Compare your solution with the exact solution given by 

9 x 
y = x + -. 


(c) Taking intervals 5x 
that 


0.2, use Euler’s unmodified method to determine y(l), given 


d y 

da; 


y + e 


and that y( 0) = 0. 

Compare your solution with the exact solution given by 


y = sinh x. 
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(d) Given that y( 1) = 2, use Euler’s unmodified method to continue the solution of the 
differential equation, 

d y _ 2 . y 

dx~ X + 2' 

to obtain values of y for values of x from x — 1 to x — 1.5, in steps of 0.1. 

2. Repeat all parts of question 1 using Euler’s modified method. 


17.6.4 ANSWERS TO EXERCISES 


(a) 

0.33, 

0.37, 

11% low; 

(b) 

1.45 

1.50. 

3% low; 

(c) 

1.113, 

1.175, 

5% low; 

(d) 

2.0000, 

2.200, 

2.431 2.697 3.001, 3.347. 

(a) 

0.371, 

0.368, 

0.8% high; 

(b) 

1.495, 

1.50, 

0.33% low; 

(c) 

1.175 , 

accurate to three decimal places; 

(d) 

2.000, 

2.216, 

2.465, 2.751, 3.079, 3.452. 


Note: 

In questions 1(d) and 2(d), the actual values are 2.000, 2.245, 2.496, 2.784, 3.113 and 3.489, 
from the exact solution of the differential equation. 


6 



“JUST THE MATHS” 


UNIT NUMBER 
17.7 

NUMERICAL MATHEMATICS 7 
(Numerical solution) 
of 

(ordinary differential equations (B)) 

by 

A. J. Hobson 


17.7.1 Picard’s method 

17.7.2 Exercises 

17.7.3 Answers to exercises 




UNIT 17.7 - NUMERICAL MATHEMATICS 7 

NUMERICAL SOLUTION OF ORDINARY DIFFERENTIAL 
EQUATIONS (B) 

17.7.1 PICARD’S METHOD 


This method of solving a differential equation approximately is one of successive approxi- 
mation; that is, it is an iterative method in which the numerical results become more and 
more accurate, the more times it is used. 


An approximate value of y (taken, at first, to be a constant) is substituted into the right 
hand side of the differential equation 


dy 

da; 


f(x,y). 


The equation is then integrated with respect to x giving y in terms of a; as a second approx- 
imation, into which given numerical values are substituted and the result rounded off to an 
assigned number of decimal places or significant figures. 

The iterative process is continued until two consecutive numerical solutions are the same 
when rounded off to the required number of decimal places. 


A hint on notation 

Imagine, for example, that we wished to solve the differential equation 


dy 

da; 


= 3a; 2 , 


given that y = y 0 = 7 when x = x 0 = 2. 


This ofcourse can be solved exactly to give 


y = x 3 + C, 


which requires that 


7 = 2 3 + C. 


Hence, 


y - 7 = x 3 - 2 


3 . 


or, in more general terms 


y-Vo = x 3 - a;jj. 
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Thus, 


ry r x 

/ d y — 3a; 2 da; 

Jyo Jx 0 


In other words, 


f X ^- dx= [ X 3x 2 dx. 

Jx o O-X Jx o 

The rule, in future, therefore, will be to integrate both sides of the given differential equation 
with respect to x, from x 0 to x. 

EXAMPLES 


1. Given that 


and that y — 0 when x 
places of decimals. 

Solution 


d v . 2 

-T- = x + y , 
da; 

0, determine the value of y when x 


To begin the solution, we proceed as follows: 


0.3, correct to four 


where Xq = 0. 
Hence, 

where yo = 0. 
That is, 





(x + y 2 ) da;, 


2/ — 2/o 



(x + y 2 ) da;, 


V = 



(x + y 2 ) dx. 


(a) First Iteration 

We do not know y in terms of x yet, so we replace y by the constant value yo in the 
function to be integrated. 

The result of the first iteration is thus given, at x — 0.3, by 


rx 

iji — I xdx 
J o 


2 

— ~ 0.0450 
2 


2 



(b) Second Iteration 

Now we use 

dy 


da; 

Therefore, 

r x dy 


Jo dx 

which gives 



X 

~4 


da; = 


r x / x 4 
x + — 


da;, 


9 ^ 

X X 

y ~ ~ ~ 2 ~ + 20 ' 

The result of the second iteration is thus given by 

9 ^ 

CC^ 1 Xj 

U2 = 1 — 0.0451 

y 2 20 

at x = 0.3. 


(c) Third Iteration 

Now we use 


dy , 2 

d^ = :c+!/2 

4 7 10 

OC'* 0C ' 0C w 

= I+ T + 20 + 400- 


Therefore, 


which gives 


r = 

/o da; 


a ; 4 a ; 7 a ; 10 \ 

g + T + S + 400 AX ’ 


y — 0 — — — T T T ^ • 

y 2 20 160 4400 


The result of the third iteration is thus given by 

~ 0.0451 at x = 0.3 


V3 2 + 20 + 160 + 4400 


Hence, y = 0.0451, correct to four decimal places, at x = 0.3. 


2. If 


dy = 2 _ V 

da; x 

and y — 2 when x — 1, perform three iterations of Picard’s method to estimate a value 
for y when x = 1.2. Work to four places of decimals throughout and state how accurate 
is the result of the third iteration. 
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Solution 

(a) First Iteration 


where Xq = 1. 
That is, 

where y 0 = 2. 
Hence, 


Replacing y by y 0 







(hr, 


y-yo 




(hr, 


v ~ 2 = l ‘ ( 2 -f) dI - 

2 in the function being integrated, we have 


Therefore, 


y = 2 + [2a; — 2 In a;]* 

= 2 + 2a; — 2 In x — 2 + 2 In 1 = 2(x — In x) . 


The result of the first iteration is thus given by 

yi = 2(x — ln.r) ~ 2.0354, 


when x = 1.2. 

(b) Second Iteration 

In this case we use 

dy ^ 2/i g 2(x — lnx) 2 In x 

dx x xx 


Hence, 


That is, 


f x dy . f x 2 Inx , 

/ —da; = / da;. 

'i da; J i x 


y — 2 = (In a;) 2 = (In a;) 5 


The result of the second iteration is thus given by 


y 2 = 2+ (In a;) 2 ~ 2.0332, 
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when x = 1.2. 


(c) Third Iteration 

Finally, we use 


Hence, 


That is, 


d y 

da; 



(In a;) 2 
x 




( Inx ) 2 
x 


da;. 


V - 2 


2 a; — 2 In a; — 


(In a;) 3 


= 2a; — 2 In x — 


(In a;) 3 
3 


- 2 . 


The result of the third iteration is thus given by 

, (In a;) 3 

2/3 = 2a; — 2 In x — ~ 2.0293, 

o 

when x = 1.2. 

The results of the last two iterations are identical when rounded off to two places of 
decimals, namely 2.03. Hence, the accuracy of the third iteration is two decimal place 
accuracy. 


17.7.2 EXERCISES 


1. Use Picard’s method to solve the differential equation 

dy T 

i L = y + e x 

ax 

at x = 1, correct to two significant figures, given that y = 0 when x — 0. 

2. Use Picard’s method to solve the differential equation 

d y 2 . V 
Tx =X + 2 

at x = 0.5, correct to two decimal places, given that y = 1 when x — 0. 
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3. Given the differential equation 


dy 

dx = 


xy, 


where y( 0) = 0, use Picard’s method to obtain y as a series of powers of x which will 
give two decimal place accuracy in the interval 0 < x < 1. 

What is the solution when x = 11 


17.7.3 ANSWERS TO EXERCISES 


1 . 


2 . 


2/(1) — 2.7 
7/(0. 5) ~ 1.33 


3. 


x 3 

y + 15 _ 105 + 945 


x 


X 


x° 


7 /( 1 ) ~ 0.72 
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UNIT 17.8 - NUMERICAL MATHEMATICS 8 

NUMERICAL SOLUTION OF ORDINARY DIFFERENTIAL 
EQUATIONS (C) 

17.8.1 RUNGE’S METHOD 

We solve, approximately, the differential equation 

y = f(x,y), 

ax 

subject to the condition that y — yo when x = xq. 

Consider the graph of the solution, passing through the two points, A(xo,yo) and 
BfYo + h,y 0 + k). 



We can say that 


That is, 



Suppose that C is the intersection with the curve of the perpendicular bisector of MN. Then, 
by Simpson’s Rule (See Unit 17.3), 

rx 0 + h h/2 

/ f(x,y)dx = -L- [/(A) + f(B) + 4f(C)] . 

J X 0 O 
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(i) The value of /(A) 

This is already given, namely, f(xo,yo). 

(ii) The Value of /(B) 



In the diagram, if the tangent at A meets BqB in Bi, then the gradient at A is given by 


BiB 0 

AB^ 


f(x 0 ,y 0 ). 


Therefore, 


BiB 0 = AB 0 f(xo,y 0 ) = hf(x 0 ,y 0 ). 
Calling this value hi, as an initial approximation to /c, we have 


k\ = hf(x 0 ,y 0 ). 


As a rough approximation to the gradient of the solution curve passing through B, we now 
take the gradient of the solution curve passing through B^ Its value is 

f(x o + h,y 0 + ki). 


To find a better approximation, we assume that a straight line of gradient f(x o + h,y 0 + k±), 
drawn at A, meets B 0 B in B 2 , a point nearer to B than Bi. 

Letting B 0 B 2 = k 2 , we have 

k 2 = hf(x o + h,y 0 + h). 
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The co-ordinates of B 2 are (xo + h, yo + k 2 ) and the gradient of the solution curve through B 2 
is taken as a closer approximation than before to the gradient of the solution curve through 
B. Its value is 

/(To + h, yo + k 2 ). 

(iii) The Value of /(C) 
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PRACTICAL LAYOUT 

If 


and y = y o when x = x 0 , then the value of y when x = x 0 + h is determined by the following 
sequence of calculations: 

1. ki = hf(x 0 ,y 0 ). 

2. k 2 = hf(x o + h,y 0 + h). 

3. k :i = hf(x o + h,y 0 + k 2 ). 

4. h'4 = hf + 2 1 yo + • 

5. k = \(k\ + k 3 + 4/c 4 ). 

6. y ~ y 0 + k. 

EXAMPLE 


Solve the differential equation 

dy 

da; 

at x = 0.1, given that y = 1 when x = 0. 

Solution 


5-3 y 


We use a; 0 = 0, y 0 — 1 and h = 0.1. 

1. ki = 0.1(5 -3) = 0.2 

2. k 2 = 0.1(5 -3[1.2]) = 0.14 

3. k 3 = 0.1(5 -3[1. 14]) = 0.158 

4. k 4 = 0.1(5 -3[1.1]) = 0.17 

5. k = |(0.2 + 0.158 + 4[0.17]) = 0.173 

6 . y — 1.173 at x = 0.1 

Note: 

It can be shown that the error in the result is of the order h 5 ; that is, the error is equivalent 
to some constant multiplied by h 5 . 

17.8.2 EXERCISES 


1. Use Runge’s Method to solve the differential equation 


at x = 0.3, given that y 


d y 

dx 

0 when x = 0. 


x + y 2 
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Work to four places of decimals throughout. 


2. Use Runge’s Method to solve the differential equation 

d y = y 

da; y + x 

at x = 1.1, given that 7/(1) = 1. 

Work to three places of decimals throughout. 


3. Use Runge’s Method with successive increments of h = 0.1 to find the solution at 
x = 0.5 of the differential equation 


d y 

da; 


= xy, 


given that y(0) = 1. Work to four decimal places throughout. 
Compare your results with those given by the exact solution 


y = e 2 


4. 


Use Runge’s Method with h = 0.2 
equation, 


given that y( 0) = 0. Work to four 


to determine the solution at x 


dy _ T 
-f- = y + e ", 
da; 

decimal places throughout. 


1 of the differential 


17.8.3 ANSWERS TO EXERCISES 


1 . 


2 . 

3. 

4. 


7 /( 0 . 3 ) ~ 0.0454 
7 /( 1 . 1) ~ 1.049 
7 /( 0 . 5) ~ 1.3318 


7 /( 1 ) ~ 1.1752 
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UNIT 18.1 - STATISTICS 1 - THE PRESENTATION OF DATA 

18.1.1 INTRODUCTION 

(i) The collection of numerical information often leads to large masses of data which, if they 
are to be understood, or presented effectively, must be summarised and analysed in some 
way. This is the purpose of the subject of “Statistics” . 

(ii) The source from which a set of data is collected is called a “population”. For example, 
a population of 1000 ball-bearings could provide data relating to their diameters. 

(iii) Statistical problems may be either “descriptive problems” (in which all the data is 
known and can be analysed) or “inference problems” (in which data collected from a 
“sample” population is used to infer properties of a larger population). For example, the 
annual pattern of rainfall over several years in a particular place could be used to estimate 
the rainfall pattern in other years. 

(iv) The variables measured in a statistical problem may be either “discrete” (in which 
case they may take only certain values) or “continuous” (in which case they make take 
any values within the limits of the problem itself. For example, the number of students 
passing an examination from a particular class of students is a discrete variable; but the 
diameter of ball-bearings from a stock of 1000 is a continuous variable. 

(v) Various methods are seen in the commercial presentation of data but, in this series of 
Units, we shall be concerned with just two methods, one of which is tabular and the other 
graphical. 

18.1.2 THE TABULATION OF DATA 
(a) Ungrouped Data 

Suppose we have a collection of measurements given by numbers. Some may occur only 
once, while others may be repeated several times. 

If we write down the numbers as they appear, the processing of them is likely to be cum- 
bersome. This is known as “ungrouped (or raw) data” , as, for example, in the following 
table which shows rainfall figures (in inches), for a certain location, in specified months over 
a 90 year period: 
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TABLE 1 - Ungrouped (or Raw ) Data 


18.6 

13.8 

10.4 

15.0 

16.0 

22.1 

16.2 

36.1 

11.6 

7.8 

22.6 

17.9 

25.3 

32.8 

16.6 

13.6 

8.5 

23.7 

14.2 

22.9 

17.7 

26.3 

9.2 

24.9 

17.9 

26.5 

26.6 

16.5 

18.1 

24.8 

16.6 

32.3 

14.0 

11.6 

20.0 

33.8 

15.8 

15.2 

24.0 

16.4 

24.1 

23.2 

17.3 

10.5 

15.0 

20.2 

20.2 

17.3 

16.6 

16.9 

22.0 

23.9 

24.0 

12.2 

21.8 

12.2 

22.0 

9.6 

8.0 

20.4 

17.2 

18.3 

13.0 

10.6 

17.2 

8.9 

16.8 

14.2 

15.7 

8.0 

17.7 

16.1 

17.8 

11.6 

10.4 

13.6 

8.4 

12.6 

8.1 

11.6 

21.1 

20.5 

19.8 

24.8 

9.7 

25.1 

31.8 

24.9 

20.0 

17.6 


( b ) Ranked Data 

A slightly more convenient method of tabulating a collection of data would be to arrange 
them in rank order, so making it easier to see how many times each number appears. This 
is known as “ranked data ”. 


The next table shows the previous rainfall figures in this form 


TABLE 2 - Ranked Data 


7.8 

8.0 

8.0 

8.1 

8.4 

8.5 

8.9 

9.2 

9.6 

9.7 

10.4 

10.4 

10.5 

10.6 

11.6 

11.6 

11.6 

11.6 

12.2 

12.2 

12.6 

13.0 

13.6 

13.6 

13.8 

14.0 

14.2 

14.2 

15.0 

15.0 

15.2 

15.7 

15.8 

16.0 

16.1 

16.2 

16.4 

16.5 

16.6 

16.6 

16.6 

16.8 

16.9 

17.2 

17.2 

17.3 

17.3 

17.6 

17.7 

17.7 

17.8 

17.9 

17.9 

18.1 

18.3 

18.6 

19.8 

20.0 

20.0 

20.2 

20.2 

20.4 

20.5 

21.1 

21.8 

22.0 

22.0 

22.1 

22.6 

22.9 

23.2 

23.7 

23.9 

24.0 

24.0 

24.1 

24.8 

24.8 

24.9 

24.9 

25.1 

25.3 

26.3 

26.5 

26.6 

31.8 

32.3 

32.8 

33.8 

36.1 
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(c) Frequency Distribution Tables 

Thirdly, it is possible to save a little space by making a table in which each individual item 
of the ranked data is written down once only, but paired with the number of times it occurs. 
The data is then presented in the form of a “frequency distribution table” . 

TABLE 3 - Frequency Distribution Table 


Value 

Frequency 

Value 

Frequency 

Value 

Frequency 

7.8 

1 

15.8 

1 

21.1 

1 

8.0 

2 

16.0 

1 

21.8 

1 

8.1 

1 

16.1 

1 

22.0 

2 

8.4 

1 

16.2 

1 

22.1 

1 

8.5 

1 

16.4 

1 

22.6 

1 

8.9 

1 

16.5 

1 

22.9 

1 

9.2 

1 

16.6 

3 

23.2 

1 

9.6 

1 

16.8 

1 

23.7 

1 

9.7 

1 

16.9 

1 

23.9 

1 

10.4 

2 

17.2 

2 

24.0 

2 

10.5 

1 

17.3 

2 

24.1 

1 

10.6 

1 

17.6 

1 

24.8 

2 

11.6 

4 

17.7 

2 

24.9 

2 

12.2 

2 

17.8 

1 

25.1 

1 

12.6 

1 

17.9 

2 

25.3 

1 

13.0 

1 

18.1 

1 

26.3 

1 

13.6 

2 

18.3 

1 

26.5 

1 

13.8 

1 

18.6 

1 

26.6 

1 

14.0 

1 

19.8 

1 

31.8 

1 

14.2 

2 

20.0 

2 

32.3 

1 

15.0 

2 

20.2 

2 

32.8 

1 

15.2 

1 

20.4 

1 

33.8 

1 

15.7 

1 

20.5 

1 

36.1 

1 


(d) Grouped Frequency Distribution Tables 

For about forty or more items in a set of numerical data, it usually most convenient to group 
them together into between 10 and 25 “classes” of values, each covering a specified range 
or “class interval” (for example, 7.5 — 10.5, 10.5 — 13.5, 13.5 — 16.5, ). 

Each item is counted every time it appears in order to obtain the “class frequency” and 
each class interval has the same “class width” . 

Too few classes means that the data is over-summarised, while too many classes means that 
there is little advantage in summarising at all. 
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Here, we use the convention that the lower boundary of the class is included while the upper 
boundary is excluded . 

Each item in a particular class is considered to be approximately equal to the “class mid- 
point”; that is, the average of the two “class boundaries”. 

A “grouped frequency distribution table” normally has columns which show the class 
intervals, class mid-points, class frequencies, and “cumulative frequencies”, the last of 
these being a running total of the frequencies themselves. There may also be a column of 
“tallied frequencies” , if the table is being constructed from the raw data without having 
first arranged the values in rank order. 

TABLE 4 - Grouped Frequency Distribution 


Class Interval 

Class Mid-point 

Tallied Frequency 

Frequency 

Cumulative Frequency 

7.5 - 10.5 

9 

////I ////I // 

12 

12 

10.5 - 13.5 

12 

////I ////I 

10 

22 

13.5 - 16.5 

15 

////I ////I ////I 

15 

37 

16.5 - 19.5 

18 

////I ////I ////I //// 

19 

56 

19.5 - 22.5 

21 

////I ////I // 

12 

68 

22.5 - 25.5 

24 

////I ////I //// 

14 

82 

25.5 - 28.5 

27 

III 

3 

85 

28.5-31.5 

30 


0 

85 

31.5-34.5 

33 

nn 

4 

89 

34.5 - 37.5 

36 

/ 

1 

90 


Notes: 

(i) The cumulative frequency shows, at a glance, how many items in the data are less than 
a specified value. In the above table, for example, 82 items are less than 25.5 

(ii) It is sometimes more useful to use the ratio of the cumulative frequency to the total 
number of observations. This ratio is called the “relative cumulative frequency” and, 
in the above table, for example, the percentage of items in the data which are less than 25.5 

is 


82 

90 


x 100 ~ 91% 
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18.1.3 THE GRAPHICAL REPRESENTATION OF DATA 


(a) The Histogram 

A “histogram” is a diagram which is directly related to a grouped frequency distribution 
table and consists of a collection of rectangles whose height represents the class frequency 
(to some suitable scale) and whose breadth represents the class width. 



The histogram shows, at a glance, not just the class intervals with the highest and lowest 
frequencies, but also how the frequencies are distributed. 

In the case of examination results, for example, there is usually a group of high frequencies 
around the central class intervals and lower ones at the ends. Such an ideal situation would 

be called a “Normal Distribution”. 

(b) The Frequency Polygon 

Using the fact that each class interval may be represented, on average, by its class mid-point, 
we may plot the class mid-points against the class frequencies to obtain a display of single 
points. By joining up these points with straight line segments and including two extra class 
mid-points, we obtain a “frequency polygon”. 
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20 


15 
10 
5 

6.0 18 39 



Notes: 

(i) Although the frequency polygon officially plots only the class mid-points against their 
frequencies, it is sometimes convenient to read-off intermediate points in order to estimate 
additional data. For example, we might estimate that the value 11.0 occurred 11 times when, 
in fact, it did not occur at all. 

We may use this technique only for continuous variables. 

(ii) Frequency polygons are more useful than histograms if we wish to compare two or more 
frequency distributions. A clearer picture is obtained if we plot them on the same diagram. 

(iii) If the class intervals are made smaller and smaller while, at the same time, the total 
number of items in the data is increased more and more, the points of the frequency polygon 
will be very close together. The smooth curve joining them is called the “frequency curve” 
and is of greater nse for estimating intermediate values. 

(c) The Cumulative Frequency Polygon (or Ogive) 

The earlier use of the cumulative frequency to estimate the number (or proportion) of values 
less than a certain amount may be applied graphically by plotting the upper class-boundary 
against the cumulative frequency, then joining up the points plotted with straight line seg- 
ments. The graph obtained is called the “cumulative frequency polygon” or “ogive”. 

We may also use a second vertical axis at the right-hand end of the diagram showing the 
relative cumulative frequency. The range of this axis will always be 0 to 1. 
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18.1.4 EXERCISES 

1. State whether the following variables are discrete or continuous: 

(a) The denominators of a set of 10,000 rational numbers; 

(b) The difference between a fixed integer and each one of any set of real numbers; 

(c) The volume of fruit squash in a sample of 200 bottles taken from different firms; 

(d) The minimum plug-gap setting specified by the makers of 30 different cars; 

(e) The points gained by 80 competitors in a skating championship. 

2. The following marks were obtained in an examination taken by 100 students: 


Marks 

25-30 

30-35 

35-40 

40-45 

45-50 

No. of Students 

2 

3 

7 

7 

8 

Marks 

50-55 

55 - 60 

60-65 

65-70 

70-75 

No. of Students 

25 

18 

12 

10 

8 


(a) Draw a histogram for this data; 

(b) Draw a cumulative frequency polygon; 

(c) Estimate the mark exceeded by the top 25% of the students; 

(d) Suggest a pass-mark if 15% of the students are to fail. 
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3. In a test of 35 glue-laminated beams, the following values of the “spring constant” in 
kilopounds per inch were found: 

SPRING CONSTANT x 100 


6.72 

6.77 

6.82 

6.70 

6.78 

6.70 

6.62 

6.75 

6.66 

6.66 

6.64 

6.76 

6.73 

6.80 

6.72 

6.76 

6.76 

6.68 

6.66 

6.62 

6.72 

6.76 

6.70 

6.78 

6.76 

6.67 

6.70 

6.72 

6.74 

6.81 

6.79 

6.78 

6.66 

6.76 

6.72 


(a) Arrange this data in rank order and hence construct a frequency distribution table; 

(b) Construct a grouped frequency distribution table with class intervals of length 0.02 
starting with 6.61 — 6.63 and include columns to show the class mid-points and the 
cumulative frequency. 

4. The following data shows the diameters of 25 ball-bearings in cms. 


0.386 

0.391 

0.396 

0.380 

0.397 

0.376 

0.384 

0.401 

0.382 

0.404 

0.390 

0.404 

0.380 

0.390 

0.388 

0.381 

0.393 

0.387 

0.377 

0.399 

0.400 

0.383 

0.384 

0.390 

0.379 


Construct a grouped frequency distribution table with class intervals 0.375 — 0.380, 

0.380 — 0.385 etc. and construct 

(a) the histogram; 

(b) the frequency polygon; 

(c) the cumulative frequency polygon (ogive). 


18.1.5 SELECTED ANSWERS TO EXERCISES 

1. (a) discrete 

(b) continuous 

(c) continuous 

(d) discrete 

(e) discrete 

2. (c) 63% is the mark exceeded by the top 25% of students 

(d) 43% needs to be the pass-mark if 15% of the students are to fail. 
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UNIT 18.2 - STATISTICS 2 


MEASURES OF CENTRAL TENDENCY 

18.2.1 INTRODUCTION 

Having shown, in Unit 18.1, how statistical data may be presented in a clear and concise 
form, we shall now be concerned with the methods of analysing the data in order to obtain 
the maximum amount of information from it. 

In the previous Unit, it was stated that statistical problems may be either “descriptive 
problems” (in which all the data is known and can be analysed) or “inference problems” 
(in which data collected from a “sample” population is used to infer properties of a larger 
population). 

In both types of problem, it is useful to be able to measure some value around which all 
items in the data may be considered to cluster. This is called “a measure of central 
tendency”; and we find it by using several types of average value as follows: 

18.2.2 THE ARITHMETIC MEAN (BY CODING) 

To obtain the “arithmetic mean” of a finite collection of n numbers, we may simply add 
all the numbers together and then divide by n. This elementary rule applies even if some of 
the numbers occur more than once and even if some of the numbers are negative. 

However, the purpose of this section is to introduce some short-cuts (called “coding”) in 
the calculation of the arithmetic mean of large collections of data. The methods will be 
illustrated by the following example, in which the number of items of data is not over-large: 

EXAMPLE 

The solid contents, x , of water (in parts per million) was measured in eleven samples and 
the following data was obtained: 

4520 4490 4500 4500 
4570 4540 4520 4590 
4520 4570 4520 

Determine the arithmetic mean, x, of the data. 
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Solution 


(i) Direct Calculation 

By adding together the eleven numbers, then dividing by 11, we obtain 

x = 49840 -r- 11 ~ 4530.91 


(ii) Using Frequencies 

We could first make a frequency table having a column of distinct values x i: (i = 1, 2, 3, , 11), 

a column of frequencies /), (i = 1,2,3 , 11) and a column of corresponding values 

The arithmetic mean is then calculated from the formula 


1 11 

x = 77 E / i X i' 
i = 1 


In the present example, the table would be 


Xi 

fi 

fiXi 

4490 

1 

4490 

4500 

2 

9000 

4520 

4 

18080 

4540 

1 

4540 

4570 

2 

9140 

4590 

1 

4590 


Total 

49840 


The arithmetic mean is then x = 49840 -j- 11 ~ 4530.91 as before. 

(iii) Reduction by a constant 

With such large data- values, as in the present example, it can be convenient to reduce all of 
the values by a constant, k, before calculating the arithmetic mean. 

It is easy to show that, by adding the constant, k, to the arithmetic mean of the reduced 
data, we obtain the arithmetic mean of the original data. 

Proof: 

For n values, xi,x 2 ,x 3 , ,x n , the arithmetic mean is given by 
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_ Xi + x 2 + x 3 + 


X r , 


X = 


n 


If each value is reduced by a constant, k, the arithmetic mean of the reduced data is 


(x 1 - k) + (x 2 - k) + (x 3 - k) + + (x n — k) x 1 + x 2 + x 3 + + x n nk _ 

= = x — k. 

n n n 


(iv) Division by a constant 

In a similar way to the previous paragraph, each value in a collection of data could by divided 
by a constant, k, before calculating the arithmetic mean. 

This time, we may show that the arithmetic mean of the original data is obtained on multi- 
plying the arithmetic mean of the reduced data by k. 

Proof: 


a J. a 4. X I _ 

k T. fc _ k T. _ k _ ± 

n k 


In order to summarise the shortcuts used in the present example, the following table shows 
a combination of the use of frequencies and of the two types of reduction made to the data: 


Xi 

Xi — 4490 

x\ = (xi - 4490) £ 10 

ft 

fiA 

4490 

0 

0 

1 

0 

4500 

10 

1 

2 

2 

4520 

30 

3 

4 

12 

4540 

50 

5 

1 

5 

4570 

80 

8 

2 

16 

4590 

100 

10 

1 

10 




Total 

45 


The fictitious arithmetic mean, x' 


45 

IT 


~ 4.0909 


The actual arithmetic mean, x ~ (4.0909 x 10) + 4490 ~ 4530.91 
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(v) The approximate arithmetic mean for a grouped distribution 

For a large number of items of data, we may (without losing too much accuracy) take all 
items within a class interval to be equal to the class mid-point. 

A calculation similar to that in the previous paragraph may then be performed if we reduce 
each mid-point by the first mid-point and divide by the class width (or other convenient 
number) . 

EXAMPLE 

Calculate, approximately, the arithmetic mean of the data in TABLE 4 on page 4 of 
Unit 18.1 


Solution 


Class Interval 

Class Mid-point 

Xi 

Xi - 9 

(xi - 9) 4 - 3 
= < 

Frequency 

fi 

fiX'i 

7.5 - 10.5 

9 

0 

0 

12 

0 

10.5 - 13.5 

12 

3 

1 

10 

10 

13.5 - 16.5 

15 

6 

2 

15 

30 

16.5 - 19.5 

18 

9 

3 

19 

57 

19.5 - 22.5 

21 

12 

4 

12 

48 

22.5 - 25.5 

24 

15 

5 

14 

70 

25.5 - 28.5 

27 

18 

6 

3 

18 

28.5-31.5 

30 

21 

7 

0 

0 

31.5-34.5 

33 

24 

8 

4 

32 

34.5 - 37.5 

36 

27 

9 

1 

9 




Totals 

90 

274 


Fictitious arithmetic mean x' 


~ 3.0444 

90 


Actual arithmetic mean = 3.044 x 3 + 9 ~ 18.13. 


Notes: 

(i) By direct calculation from TABLE 1 in Unit 18.1, it may be shown that the arithmetic 
mean is 17.86 correct to two places of decimals; and this indicates an error of about 1.5%. 

(ii) The arithmetic mean is widely used where samples are taken of a larger population. It 
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usually turns out that two samples of the same population have arithmetic means which are 
close in value. 

18.2.3 THE MEDIAN 

Collections of data often include one or more values which are widely out of character with 
the rest; and the arithmetic mean can be significantly affected by such extreme values. 

For example, the values 8,12,13,15,21,23 have an arithmetic mean of ~ 15.33; but the 
values 5,12,13,15,21,36 have an arithmetic mean of ~ 17.00. 

A second type of average, not so much affected, is defined as follows: 

DEFINITION 

The “median” of a collection of data is the middle value when the data is arranged in rank 
order. For an even number of values in the collection of data, the median is the arithmetic 
mean of the centre two values. 

EXAMPLES 


1. For both 8,12,13,15,21,23 and 5,12,13,15,21,36, the median is given by 


13 + 15 
2 


14. 


2. For a grouped distribution, the problem is more complex since we no longer have access 
to the individual values from the data. 

However, the area of a histogram is directly proportional to the total number of values 
which it represents, since the base of all the rectangles are the same width and each 
height represents a frequency. 

We may thus take the median to be the value for which the vertical line through it 
divides the histogram into two equal areas. 

For non-symmetrical histograms, the median is often a better measure of central ten- 
dency than the arithmetic mean. 

Illustration 

Consider the histogram from Unit 18.1, representing rainfall figures over a 90 year 
period. 
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The total area of the histogram = 90 x 3 = 270. 

Half the area of the histogram = 135. 

The area up as far as 16.5 = 3 x 37 = 111 while the area up as far as 19.5 = 3 x 56 = 168; 
hence the Median must lie between 16.5 and 19.5 

The Median = 16.5 + x where 18a; = 135 — 111 = 24 since 18 is the frequency of the class 
interval 16.5 — 19.5 

That is, 


24 




1.33, 


giving a Median of 17.83 

Notes: 

(i) The median, in this case, is close to the arithmetic mean since the distribution is fairly 
symmetrical. 

(ii) If a sequence of zero frequencies occurs, it may be necessary to take the arithmetic mean 
of two class mid-points, which are not consecutive to each other. 

(iii) Another example of the advantage of median over arithmetic mean would be the average 
life of 100 electric lamps. To find the arithmetic mean, all 100 must be tested; but to find 
the median, the testing may stop after the 51st. 
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18.2.4 THE MODE 


DEFINITIONS 

1. For a collection of individual items of data, the “mode” is the value having the highest 
frequency. 

2. In a grouped frequency distribution, the mid-point of the class interval with the highest 
frequency is called the “crude mode” and the class interval itself is called the “modal 
class” . 

Note: 

Like the median, the mode is not much affected by changes in the extreme values of the data. 
However, some distributions may have several different modes, which is a disadvantage of 
this measure of central tendency. 

EXAMPLE 

For the histogram discussed earlier, the mode is 18.0; but if the class interval, 22.5 — 25.5, 
had 5 more members, then 24.0 would be a mode as well. 

18.2.5 QUANTILES 

To conclude this Unit, we shall define three more standard measurements which, in fact, 
extend the idea of a median; and we may recall that a median divides a collection of values 
in such a way that half of them fall on either side of it. 

Collectively, these three new measurements are called “quantiles” but may be considered 
separately by their own names as follows: 

(a) Quartiles 

These are the three numbers dividing a ranked collection of values (or the area of a histogram) 
into 4 equal parts. 

(b) Deciles 

These are the nine numbers dividing a ranked collection of values (or the area of a histogram) 
into 10 equal parts. 

(c) Percentiles 

These are the ninety nine numbers dividing a ranked collection of values (or the area of a 
histogram) into 100 equal parts. 
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Note: 

For collections of individual values, quartiles may need to be calculated as the arithmetic 
mean of two consecutive values. 

EXAMPLES 

1. (a) The 25th percentile = The 1st quartile. 

(b) The 5th Decile = The median. 

(c) The 85th Percentile = the point at which 85% of the values fall below it and 15% 
above it. 

2. For the collection of values 

5,12,13,19,25,26,30,33, 

the quartiles are 12.5, 22 and 28. 

3. For the collection of values 

5,12,13,19,25,26,30, 
the quartiles are 12.5, 19 and 25.5 

18.2.6 EXERCISES 

1. The arithmetic mean of 75 observations is 52.6 and the arithmetic mean of 25 similar 
observations is 48.4; determine the Arithmetic Mean of all 100 observations. 

2. Of 500 students, whose mean height is 67.8 inches, 150 are women. If the mean height 
of 150 women is 62.0 inches, what is the mean height of the men ? 

3. By coding the following collection of data, determine the arithmetic mean correct to 
three places of decimals: 

1.847, 1.843, 1.842, 1.847, 1.848, 1.841, 1.845 

4. Using a histogram of the frequency distribution shown, determine 

(a) the arithmetic mean; 

(b) the median class; 

(c) the median; 



(d) the modal class; 

(e) the crude mode. 


class interval 

15-25 

25-35 

35-45 

45-55 

55-65 

65-75 

Frequency 

4 

11 

19 

14 

0 

2 


5. The number of a certain component issued, per day, from stock over a 40 day period is 
given as follows: 


83 

80 

91 

81 

88 

82 

87 

97 

83 

99 

75 

85 

72 

92 

84 

90 

87 

78 

93 

98 

86 

80 

93 

86 

88 

83 

82 

101 

89 

82 

85 

95 

80 

89 

84 

92 

76 

81 

103 

94 


Using class intervals 70 — 75, 75 — 80, 80 — 85 etc., draw up a frequency distribution 
table and construct a histogram. 

From the histogram, determine the median and the 7th Decile. 


18.2.7 ANSWERS TO EXERCISES 


1 . 

2 . 

3. 


4. (a) 

(b) 

(c) 

(d) 


5. 


51.55 


70.3 inches. 


1.845 

40.20 


35 - 45. 


40.26 


35 - 45. 


40. 

Median = 86.5, 7th Decile = 91.25 
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UNIT 18.3 - STATISTICS 3 


MEASURES OF DISPERSION (OR SCATTER) 

18.3.1 INTRODUCTION 

Averages typify a whole collection of values, but they give little information about how the 
values are distributed within the whole collection. 

For example, 99.9, 100.0, 100.1 is a collection which has an arithmetic mean of 100.0 and so 
is 99.0,100.0,101.0; but the second collection is more widely dispersed than the first. 

It is the purpose of this Unit to examine two types of quantity which typify the distance of 
all the values in a collection from their arithmetic mean. They are known as measures of 
dispersion (or scatter) and the smaller these quantities are, the more clustered are the values 
around the arithmetic mean. 

18.3.2 THE MEAN DEVIATION 

If the n values aq, aq , aq, ,x n have an arithmetic mean of x, then x\ — x, X 2 — x, 

aq — x, ,x n — x are called the “deviations” of aq, aq, aq, , x n from the arithmetic 

mean. 

Note: 

The deviations add up to zero since 


E 


n n 

(aq — x) — aq — ^ x — nx — nx = 0 
1=1 1=1 


DEFINITION 

The “mean deviation” (or, more accurately, the “mean absolute deviation ) is defined by 
the formula 


1 n 

M.D. = — Xj — x 
n 
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18.3.3 PRACTICAL CALCULATION OF MEAN DEVIATION 


In calculating a mean deviation, the following short-cuts usually turn out to be useful, 
especially for larger collections of values: 

(a) If a constant, k, is subtracted from each of the values ay (i = l,2,3...n), and also we use 
the “fictitious” arithmetic mean, x — k, in the formula, then the mean deviation is unaffected. 

Proof: 


in I n 

\ x i~M = K®* -k)-(x- k)\. 

n i = 1 n i= 1 

(b) If we divide each of the values ay (i = l,2,3,...n) by a positive constant, /, and also we 
use the “fictitious” arithmetic mean |, then the mean deviation will be divided by /. 

Proof: 


1 n 

— ]T |ay -x 

InU. 



ay 

l 


x 

1 ' 


Summary 

If we code the data using both a subtraction by k and a division by /, the value obtained 
from the mean deviation formula needs to multiplied by l to give the correct value. 

18.3.4 THE ROOT MEAN SQUARE (OR STANDARD) DEVIATION 

A more common method of measuring dispersion, which ensures that negative deviations 
from the arithmetic mean do not tend to cancel out positive deviations, is to determine the 
arithmetic mean of their squares, and then take the square root. 

DEFINITION 

The “root mean square deviation” (or “standard deviaton ”) is defined by the formula 


R.M.S.D. 


1 n 

\|«§ 
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Notes: 


(i) The root mean square deviation is usually denoted by the symbol, a. 

(ii) The quantity cr 2 is called the “variance” . 

18.3.5 PRACTICAL CALCULATION OF THE STANDARD DEVIATION 

In calculating a standard deviation, the following short-cuts usually turn out to be useful, 
especially for larger collections of values: 

(a) If a constant, k. is subtracted from each of the values ay (i = l,2,3...n), and also we use 
the “fictitious” arithmetic mean, x — k, in the formula, then a is unaffected. 

Proof: 


\ 


- E ( x i - x ) 2 = 


n u i 


\ 


“E [& - k) - (x - k)Y 


n i = i 


(b) If we divide each of the values ay (i = 1,2,3, ...n) by a constant, /, and also we use the 
“fictitious” arithmetic mean |, then a will be divided by l. 

Proof: 


'N 



(Xj — x ) 2 = 


\ 


It, 


n 


i= 1 




2 


Summary 

If we code the data using both a subtraction by k and a division by /, the value obtained 
from the standard deviation formula needs to multiplied by l to give the correct value, a. 

(c) For the calculation of the standard deviation, whether by coding or not, a more convenient 
formula may be obtained by expanding out the expression (ay — x) 2 as follows: 

n n n 

E - 2 T E ay + E X 2 . 

. 2—1 i = 1 2=1 
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That is, 


which gives the formula 


o" J 



2 . , — 2 i — 2 

x i — 2x + x , 


a = 





x 2 . 


Note: 

In advanced statistical work, the above formulae for standard deviation are used only for 
descriptive problems in which we know every member of a collection of observations. 

For inference problems, it may be shown that the standard deviation of a sample is always 
smaller than that of a total population; and the basic formula used for a sample is 


a = 


\ 




(Xi - x) 2 


18.3.6 OTHER MEASURES OF DISPERSION 

We mention here, briefly, two other measures of dispersion: 

(i) The Range 

This is the difference between the highest and the smallest members of a collection of values. 

(ii) The Coefficient of Variation 

This is a quantity which expresses the standard deviation as a percentage of the arithmetic 
mean. It is given by the formula 

C.V. = = x 100. 

x 
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EXAMPLE 


The following grouped frequency distribution table shows the diameter of 98 rivets: 


Class 

Intvl. 

Cls. Mid 
Pt. Xi 

Freq. 

fi 

Cum. 

Freq. 

(xi - 6.61) 4- 0.02 

= x/ 

fiXi 

72 

Xi 

-P 

JiXi 

fi\Xi - x'\ 

6.60 — 6.62 

6.61 

1 

1 

0 

0 

0 

0 

0.58 

6.62 — 6.64 

6.63 

4 

5 

1 

4 

1 

4 

2.40 

6.64 — 6.66 

6.65 

6 

11 

2 

12 

4 

24 

3.72 

6.66 — 6.68 

6.67 

12 

23 

3 

36 

9 

108 

7.68 

6.68 - 6.70 

6.69 

5 

28 

4 

20 

16 

80 

3.30 

6.70 - 6.72 

6.71 

10 

38 

5 

50 

25 

250 

6.80 

6.72 - 6.74 

6.73 

17 

55 

6 

102 

36 

612 

11.90 

6.74 - 6.76 

6.75 

10 

65 

7 

70 

49 

490 

7.20 

6.76 - 6.78 

6.77 

14 

79 

8 

112 

64 

896 

10.36 

6.78 - 6.80 

6.79 

9 

88 

9 

81 

81 

729 

6.84 

6.80-6.82 

6.81 

7 

95 

10 

70 

100 

700 

5.46 

6.82 - 6.84 

6.83 

2 

97 

11 

22 

121 

242 

1.60 

6.84-6.86 

6.85 

1 

98 

12 

12 

144 

144 

0.82 

Totals 


98 



591 


4279 

68.66 


Estimate the arithmetic mean, the standard deviation and the mean (absolute) deviation of 
these diameters. 


Solution 


Fictitious arithmetic mean 


591 

~98~ 


~ 6.03 


Actual arithmetic mean = 6.03 x 0.02 + 6.61 ~ 6.73 


/4279 

Fictitious standard deviation = \ — — 6.03 2 ~ 2.70 


Actual standard deviation = 2.70 x 0.02 ~ 0.054 


, , 68.66 

fictitious mean deviation = ~ 0.70 
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Actual mean deviation ~ 0.70 x 0.02 ~ 0.014 


18.3.7 EXERCISES 

1. For the collection of numbers 

6.5, 8.3, 4.7, 9.2, 11.3, 8.5, 9.5, 9.2 

calculate (correct to two places of decimals) the arithmetic mean, the standard deviation 
and the mean (absolute) deviation. 

2. Estimate the arithmetic mean, the standard deviation and the mean (absolute) devia- 
tion for the following grouped frequency distribution table: 


Class Interval 

10-30 

30-50 

50-70 

70-90 

90 - 110 

110 - 130 

Frequency 

5 

8 

12 

18 

3 

2 


3. The following table shows the registered speeds of 100 speedometers at 30m. p.h. 


Regd. Speed 

Frequency 

27.5 - 28.5 

2 

28.5 - 29.5 

9 

29.5 - 30.5 

17 

30.5-31.5 

26 

31.5-32.5 

24 

32.5 - 33.5 

16 

33.5 - 34.5 

5 

34.5 - 35.5 

1 


Estimate the arithmetic mean, the standard deviation and the coefficient of variation. 

18.3.8 ANSWERS TO EXERCISES 

1. Arithmetic mean ~ 8.40, standard deviation ~ 1.88, mean deviation ~ 1.43 

2. Arithmetic mean ~ 65, standard deviation ~ 24.66, mean deviation ~ 20.21 

3. Arithmetic mean ~ 31.4, standard deviation ~ 1.44, coefficient of variation ~ 4.61. 
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UNIT 18.4 - STATISTICS 4 


THE PRINCIPLE OF LEAST SQUARES 
18.4.1 THE NORMAL EQUATIONS 

Suppose two variables, x and y, are known to obey a “straight line law” of the form 
y = a + bx, where a and b are constants to be found. 

Suppose also that, in an experiment to test this law, we obtain n pairs of values, ( x^y ^ ), 
where % = 1,2,3, ...,n. 

If the values Xi are assigned values, they are likely to be free from error, whereas the 
observed values, y t will be subject to experimental error. 

The principle underlying the straight line of “best fit” is that, in its most likely position, 
the sum of the squares of the ^/-deviations, from the line, of all observed points is a minimum. 

Using partial differentiation, it may be shown that 


E 


n 

y% = na + b 

i= 1 


Xi 


( 1 ) 


and 


E x ah = «E x i + & E x i ( 2 )- 

i = 1 i= 1 i= 1 

The statements (1) and (2) are two simultaneous equations which may be solved for a and 
b. 

They are called the “normal equations” . 

A simpler notation for the normal equations is 


and 


E y = na + 6E x 


E xy = aEx + 6E x 2 . 
By eliminating a and b in turn, we obtain the solutions 
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a = 


and b 


E a; 2 .E y — Ea;.E xy 
nE x 2 - (E a;) 2 


nE xy — E x.E y 
nE a; 2 - (E a;) 2 ' 


With these values of a and 6, the straight line y = a + bx is called the “regression line of 
y on a;”. 

EXAMPLE 

Determine the equation of the regression line of y on x for the following data which shows 
the Packed Cell Volume, mini, and the Red Blood Cell Count, y millions, of 10 dogs: 


X 

45 

42 

56 

48 

42 

35 

58 

40 

39 

50 

y 

6.53 

6.30 

9.52 

7.50 

6.99 

5.90 

9.49 

6.20 

6.55 

8.72 


Solution 


X 

y 

xy 

x ' 2 

45 

6.53 

293.85 

2025 

42 

6.30 

264.60 

1764 

56 

9.52 

533.12 

3136 

48 

7.50 

360.00 

2304 

42 

6.99 

293.58 

1764 

35 

5.90 

206.50 

1225 

58 

9.49 

550.42 

3364 

40 

6.20 

248.00 

1600 

39 

6.55 

255.45 

1521 

50 

8.72 

436.00 

2500 

455 

73.70 

3441.52 

21203 


The regression line of y on x thus has equation y = a + bx, where 


(21203) (73.70) - (455) (3441.52) 
(10)(21203) - (455) 2 


~ -0.645 


and 


_ (10) (3441.52) - (455) (73.70) 
(10)21203) - (455) 2 


Thus, y = 0.176a; — 0.645 
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18.4.2. SIMPLIFIED CALCULATION OF REGRESSION LINES 


A simpler method of determining the regression line of y on x for a given set of data, is to 
consider a temporary change of origin to the point ( x,y ), where x is the arithmetic mean of 
the values Xi and y is the arithmetic mean of the values y l . 

RESULT 

The regression line of y on x contains the point (x,y). 

Proof: 

From the first of the normal equations, 


Sy 

n 


a + b 


E x 


n 


That is, 


y — a + bx. 


A change of origin to the point ( x,y ), with new variables X and Y, is associated with the 
formulae 


X = x — x and Y = y — y 

and, in this system of reference, the regression line will pass through the origin. 
Its equation is therefore 


Y = BX, 


where 


nE XY - E XE Y 
nE A 2 - (E A ) 2 
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However, 


E X — E (x — x) = E x — E x = nx — nx = 0 


and 


Ey = E (y — y) — Y, y — Y, y — ny — ny — 0. 


Thus, 


B 


s xy 

E X 2 ' 


Note: 

In a given problem, we make a table of values of Xi, yi, X l: Y t} XiYi and Xf. 
The regression line is then 


y — y — B(x — x) or y — BX + (y — Bx)] 

though, there may be slight differences in the result obtained compared with that from the 
earlier method. 

EXAMPLE 

Determine the equation of the regression line of y on x for the following data which shows 
the Packed Cell Volume, xmm, and the Red Blood Cell Count, y millions, of 10 dogs: 


X 

45 

42 

56 

48 

42 

35 

58 

40 

39 

50 

y 

6.53 

6.30 

9.52 

7.50 

6.99 

5.90 

9.49 

6.20 

6.55 

8.72 


Solution 

The arithmetic mean of the x values is x = 45.5 
The arithmetic mean of the y values is y — 7.37 
This gives the following table: 
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X 

y 

X = x — x 

Y — y — y 

XY 

X' 2 

45 

6.53 

-0.5 

-0.84 

0.42 

0.25 

42 

6.30 

-3.5 

-1.07 

3.745 

12.25 

56 

9.52 

10.5 

2.15 

22.575 

110.25 

48 

7.50 

2.5 

0.13 

0.325 

6.25 

42 

6.99 

-3.5 

-0.38 

1.33 

12.25 

35 

5.90 

-10.5 

-1.47 

15.435 

110.25 

58 

9.49 

12.5 

2.12 

26.5 

156.25 

40 

6.20 

-5.5 

-1.17 

6.435 

30.25 

39 

6.55 

-6.5 

-0.82 

5.33 

42.25 

50 

8.72 

4.5 

1.35 

6.075 

20.25 

455 

73.70 



88.17 

500.5 


Hence, 


B 


88.17 

500.5 


~ 0.176 


and so the regression line has equation 

y = 0.176a; + (7.37 - 0.176 x 45.5). 


That is, 


y = 0.176a; -0.638 


18.4.3 EXERCISES 

1. For the following tables, determine the regression line of y on x, assuming that y = a+bx. 
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X 

l 

3 

5 

10 

12 

y 

58 

55 

40 

37 

22 


2. To determine the relation between the normal stress and the shear resistance of soil, a 
shear-box experiment was performed, giving the following results: 


Normal Stress, x p.s.i. 

11 

13 

15 

17 

19 

21 

Shear Stress, y p.s.i. 

15.2 

17.7 

19.3 

21.5 

23.9 

25.4 


If y — a + bx, determine the regression line of y on x. 

3. Fuel consumption, y miles per gallon, at speeds of x miles per hour, is given by the 
following table: 


X 

20 

30 

40 

50 

60 

70 

80 

90 

y 

18.3 

18.8 

19.1 

19.3 

19.5 

19.7 

19.8 

20.0 


Assuming that 

b 

V = a+ -, 
x 

determine the most probable values of a and b. 


18.4.4 ANSWERS TO EXERCISES 


1. (a) 

(b) 


y = 6.46 — 3.52a:; 


y = 54.20 + 0.52a;; 


y = 60.78 - 2.97a;. 


2 . 


y = 4.09 + 1.03a;. 


3. 


a ~ —42 and b ~ 20. 
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UNIT 19.1 - PROBABILITY 1 - DEFINITIONS AND RULES 


19.1.1 INTRODUCTION 

To introduce the definition of probability, suppose 30 high-strength bolts became mixed with 
25 ordinary bolts by mistake, all of the bolts being identical in appearance. 

We would like to know how sure we can be that, in choosing a bolt, it will be a high-strength 
one. Phrases like “quite sure” or “fairly sure” are useless, mathematically, and we define a 
way of measuring the certainty. 

We know that, in 55 simultaneous choices, 30 will be of high strength and 25 will be ordinary; 
so we say that, in one choice, there is a || chance of success; that is, approximately, a 0.55 
chance of success. 

Obviously, in one single choice, we haven’t any idea what the result will be; but experience 
has proved that, in a significant number of choices, just over half will most likely give a 
high-strength bolt. 

Such predictions can be used, for example, to estimate the cost of mistakes on a production 
line. 

DEFINITION 1. 

The various occurrences which are possible in a statistical problem are called “events” . 

If we are interested in one particular event, it is termed “successful” when it occurs and 
“unsuccessful” when it does not. 

ILLUSTRATION 

If, in a collection of 100 bolts, there are 30 high-strength, 25 ordinary and 45 low-strength, 
then we have three possible events according to which type is chosen. 

We can make 100 “trials” and, in each trial, one of three events will occur. 

DEFINITION 2. 

If, in n possible trials, a successful event occurs s times, then the number 1 is called the 

“probability of success in a single trial” . It is also known as the “relative frequency 
of success”. 

ILLUSTRATIONS 


1. From a bag containing 7 black balls and 4 white balls, the probability of drawing a 
white ball is , 4 . . 
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2. In tossing a perfectly balanced coin, the probability of obtaining a head is 

3. In throwing a die, the probability of getting a six is jL 

4. If 50 chocolates are identical in appearance, but consist of 15 soft-centres and 35 hard- 
centres, the probability of choosing a soft-centre is = 0.3. 

19.1.2 APPLICATION OF PROBABILITY TO GAMES OF CHANCE 

If a competitor in a game of chance has a probability, p, of winning, and the prize money is 
£m, then £mp is considered to be a fair price for entry to the game. 

The quantity nip is known as the “expectation” of the competitor. 

19.1.3 EMPIRICAL PROBABILITY 

So far, all the problems discussed on probability have been “descriptive”; that is, we know all 
the possible events, the number of successes and the number of failures. In other problems, 
called “inference” problems, it is necessary either (a) to take “samples” in order to infer 
facts about a total “population’ (for example, a public census or an investigation of moon- 
rock); or (b) to rely on past experience (for example past records of heart deaths, road 
accidents, component failure). 

If the probability of success, used in a problem, has been inferred by samples or previous 
experience, it is called “empirical probability”. 

However, once the probability has been calculated, the calculations are carried out in the 
same way as for descriptive problems. 

19.1.4 TYPES OF EVENT 
DEFINITION 3. 

If two or more events are such that not more than one of them can occur in a single trial, 
they are called “mutually exclusive” . 

ILLUSTRATION 

Drawing an Ace or drawing a King from a pack of cards are mutually exclusive events; but 
drawing an Ace and drawing a Spade are not mutually exclusive events. 

DEFINITION 4. 

If two or more events are such that the probability of any one of them occurring is not 
affected by the occurrence of another, they are called “independent” events. 
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ILLUSTRATION 


From a pack of 52 cards (that is, Jokers removed), the event of drawing and immediately 
replacing a red card will have a probability of || = 0.5; and the probability of this occurring 
a second time will be exactly the same. They are independent events. 

However, two successive events of drawing a red card without replacing it are not indepen- 
dent. If the first card drawn is red, the probability that the second is red will be but, if 
the first card drawn is black, the probability that the second is red will be || 

19.1.5 RULES OF PROBABILITY 

1. If Pi,P 2 ,P 3 , , p r are the separate probabilities of r mutually exclusive events, then 

the probability that some one of the r events will occur is 

P 1 +P 2 +P 3 + + Pr- 


ILLUSTRATION 

Suppose a bag contains 100 balls of which 1 is red, 2 are blue and 3 are black. The 
probability of choosing any one of these three colours will be 

0.06 = 0.01 + 0.02 + 0.03 

However, the probability of drawing a spade or an ace from a pack of 52 cards will not 
be || + T — 11 but || since there are just 16 cards which are either a spade or an ace. 

2. If pi,/> 2 , 7 +, ,p r are the separate probabilities of r independent events, then the 

probability that all will occur in a single trial is 

7+ +2-713 Pr- 


ILLUSTRATION 

Suppose there are three bags, each containing white, red and blue balls. Suppose also 
that the probabilities of drawing a white ball from the first bag, a red ball from the 
second bag and a blue ball from the third bag are respectively pi,p 2 and p 3 . The 
probability of making these three choices in succession is pi+ 2+3 because they are 
independent events. 

However, if three cards are drawn, without replacing, from a pack of 52 cards, the 

probability of drawing a 3, followed by an ace, followed by a red card will not be 
jL A 26 
52 ‘ 52 ’ 52 ' 
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19.1.6 CONDITIONAL PROBABILITIES 


For dependent events, the multiplication rule requires a knowledge of the new probabilities 
of successive events in the trial, after the previous ones have been dealt with. These are 

called “conditional probabilitities” . 

EXAMPLE 

From a box, containing 6 white balls and 4 black balls, 3 balls are drawn at random without 
replacing them. What is the probability that there will be 2 white and 1 black ? 

Solution 

The cases to consider, together with their probabilities are as follows: 

(a) White, White, Black Probability = jj)X§x§ = ff§ = |- 

(b) Black, White, White Probability = x[jX|x| = ||^ = |. 

(c) White, Black, White Probability = ^ x |x| = pfj = ^. 

The probability of any one of these three outcomes is therefore 

1 1 1 _ 1 
6 + 6 + 6 ~~ 2 ' 


19.1.7 EXERCISES 

1. A card is drawn at random from a deck of 52 playing cards. What is the probability 
that it is either an Ace or a picture card ? 

2. If a die is rolled, what is the probability that the roll yields either a 3 or a 4 ? 

3. In a single throw of two dice, what is the probability that a 9 or a doublet will be 
thrown ? 

4. Ten balls, numbered 1 to 10, are placed in a bag. One ball is drawn and not replaced; 
and then a second ball is drawn. What are the probabilities that 

(a) the balls numbered 3 and 7 are drawn; 

(b) neither of these two balls are drawn ? 

5. On a gaming machine, there are three reels with ten digits 0,1, 2, 3, 4, 5, 6, 7, 8, 9 plus a star 
on each reel. When a coin is inserted, and the machine started, the three reels revolve 
independently before coming to rest. 
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(a) What is the probabiity of getting a particular sequence of numbers ? 

(b) What is the probability of getting three stars ? 

(c) What is the probability of getting the same number on each reel ? 

6. Three balls in succession are drawn, without replacement, from a bag containing 8 
black, 8 white and 8 red balls. If a prize of £5 is awarded for drawing no black balls, 
what is the expectation ? 

7. Three persons A,B and C take turns to throw three dice once. If the first one to throw 
a total of 11 is awarded a prize of £200, what are the expectations of A,B and C ? 

19.1.8 ANSWERS TO EXERCISES 



4. (a) 


1 

45’ 

(b) 


5. (a) 

(b) 


45' 


1 

IP 


~ 0.00075; 


1 

IP 


~ 0.00075; 


10 

IP 


~ 0.0075 


6. The expectation is £1.38 

7. For A,B and C, the expectations are £25, £21.88 and £19.14 respectively since the 
probabilities are |, p- and p respectively. 
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UNIT 19.2 - PROBABILITY 2 - PERMUTATIONS AND COMBINATIONS 


19.2.1 INTRODUCTION 

In Unit 19.1, we saw that, in the type of problem known as “descriptive”, we can work out 
the probability that an event will occur by counting up the total number of possible trials 
and the number of successful ones amongst them. But this can often be a tedious process 
without the results of the work which is included in the present Unit. 

DEFINITION 1. 

Each different arrangement of all or part of a set of objects is called a “permutation” . 

DEFINITION 2. 

Any set which can be made by using all or part of a given collection of objects, without 
regard to order, is called a “combination” . 

EXAMPLES 

1. Nine balls, numbered 1 to 9, are put into a bag, then emptied into a channel which 
guides them into a line of pockets. What is the probability of obtaining a particular 
nine digit number ? 

Solution 

We require the total number of arrangements of the nine digits and this is evaluated as 
follows: 

There are nine ways in which a digit can appear in the first pocket and, for each of these 
ways, there are then eight choices for the second pocket. Hence, the first two pockets 
can be filled in 9 x 8 = 72 ways. 

Continuing in this manner, the total number of arrangements will be 
9x8x7x6x x3x2x xl = 362880 = T, (say). 

This is the number of permutations of the ten digits and the required probability is 
therefore equal to 

2. A box contains five components of identical appearance but different qualities. What 
is the probability of choosing a pair of components from the highest two qualities ? 

Solution 
Method 1. 

Let the components be A, B, C, D, E in order of descending quality. 


1 



The choices are 
AB AC AD AE 
BC BD BE 
CD CE 
DE, 

giving ten choices. 

These are the various combinations of five objects, two at a time; hence, the probability 
for AB = A 

Method 2. 

We could also use the ideas of conditional probability as follows: 

The probability of drawing A = |. 

The probability of drawing B without replacing A = |. 

The probability of drawing A and B in either order = 


19.2.2 RULES OF PERMUTATIONS AND COMBINATIONS 


1. The number of permutations of all n objects in a set of n is 

n(n — l)(n — 2) 3.2.1 

which is denoted for short by the symbol n! It is called “n factorial” . 
This rule was demonstrated in Example 1 of the previous section. 

2. The number of permutations of n objects r at a time is given by 

Til 

n(n — l)(n — 2) (n — r + 1) = 

[n — ry 


Explanation 

The first object can be chosen in any one of n different ways. 

For each of these, the second object can then be chosen in n — 1 ways. 
For each of these, the third object can then be chosen in n — 2 ways. 


For each of these, the r-th object can be chosen in n — (r — 1) = n — r + 1 ways. 
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Note: 

In Example 2 of the previous section, the number of permutations of five components 
two at a time is given by 

5! 5! 5.4.3.2.1 

rr = — = = 20. 

(5-2)! 3! 3.2.1 

This is double the number of choices we obtained for any two components out of five 
because, in a permutation, the order matters. 

3. The number of combinations of n objects r at a time is given by 

n\ 

( n — r)!r! 


Explanation 

This is very much the same problem as the number of permutations of n objects r at a 
time; but, as permutations, a particular set of objects will be counted r! times. In the 
case of combinations , such a set will be counted only once, which reduces the number 
of possibilities by a factor of r!. 

In Example 2 of the previous section, it is precisely the number of combinations of five 
objects two at a time which is being calculated. That is, 

5! 5! 5.4.3.2.1 

= = = 10 

(5 — 2)!2! 3!2! 3.2. 1.2.1 


as before. 

Note: 

A traditional notation for the number of permutations of n objects r at a time is n P r . 
That is, 

n\ 

( n — r)! 

combinations of n objects r at a time is n C r . 
n\ 

( n — r)!r! 




A traditional notation for the number of 
That is, 


n/^i 

O y 
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EXAMPLES 


1. How many four digit numbers can be formed from the numbers 1,2, 3, 4, 5, 6, 7, 8, 9 if no 
digit can be repeated ? 

Solution 

This is the number of permutations of 9 objects four at a time; that is, 

9' 

' = 9. 8. 7.6. = 3,024. 

5! 

2. In how many ways can a team of nine people be selected from twelve ? 

Solution 


The required number is 


I2 n 
L-o — 


12 ! 

3!9! 


12 . 11.10 

3.2.1 


220 . 


3. In how many ways can we select a group of three men and two women from five men 
and four women ? 

Solution 

The number of ways of selecting three men from five men is 


5 


Os 


5! 

2!3! 


5.4. 

~2A 


10 . 


For each of these ways, the number of ways of selecting two women from four women is 


4 C 2 


4! _ 4.3 _ 
2!2! “ 2T “ 6 ' 


The total number of ways is therefore 10 x 6 = 60. 


4. What is the probability that one of four bridge players will obtain a thirteen 
card suit ? 

Solution 

The number of possible suits for each player is 


N 


52! 

~ 39!13! 


The probabilty that any one of the four players will obtain a thirteen card suit is thus 


4 x 


1 

N 


4.(39!)(13!) 

52! 


~ 6.29 x 10~ 10 . 
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5. A coin is tossed six times. Determine the probability of obtaining exactly four heads. 

Solution 

In a single throw of the coin, the probability of a head (and of a tail) is 

Secondly, the probability that a particular four out of six throws will be heads, and the 
other two tails, will be 


1111 [1 r 
2 2 2 2 1 . 2 ' 2 . 


1 _ 1 
¥ ~ 64' 


Finally, the number of choices of four throws from six throws is 


6 


C 4 


6 ! 

2!4! 


15. 


Hence, the required probability of exactly four heads is 


19.2.3 PERMUTATIONS OF SETS WITH SOME OBJECTS ALIKE 


INTRODUCTORY EXAMPLE 

Suppose twelve switch buttons are to be arranged in a row, and there are two red buttons, 
three yellow and seven green. How many possible distinct patterns can be formed ? 

Solution 


If all twelve buttons were of a different colour, there would be 12! possible arrangements. 

If we now colour two switches red, there will be only half the number of arrangements since 
every pair of positions previously held by them would have counted 2! times; that is, twice. 
If we then colour another three switches yellow, the positions previously occupied by them 
would have counted 3! times; that is, 6 times, so we reduce the number of arrangements 
further by a factor of 6. 

Similarly, by colouring another seven switches green, we reduce the number of arrangements 
further by a factor of 7!. 


Hence, the final number of arrangements will be 


12 ! 

2!3!7! 


7920. 


This example illustrates another standard rule that, if we have n objects of which rq are alike 

of one kind, r 2 are alike of another, r 3 are alike of another and rq. are alike of another, 

then the number of permutations of these n objects is given by 


rq!r 2 !r 3 !...r fe ! 
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19.2.4 EXERCISES 


1. Evaluate the following: 

(a) 6 P 3 ; (b) 6 C 3 ; (c) 7 P 4 ; (d) 7 C 4 . 

2. Verify that n C n - r = n C r in the following cases: 

(a) n — 7 and r = 2; (b) n = 10 and r = 3; (c) n — 5 and r = 2. 

3. How many four digit numbers can be formed from the digits 1,3, 5, 7, 8, 9 if none of the 
digits appears more than once in each number ? 

4. In how many ways can three identical jobs be filled by 12 different people ? 

5. From a bag containing 7 black balls and 5 white balls, how many sets of 5 balls, of 
which 3 are black and two are white, can be drawn ? 

6. In how many seating arrangements can 8 people be placed around a table if there are 

3 who insist on sitting together ? 

7. A commitee of 10 is to be selected from 6 lawyers 8 engineers and 5 doctors. If the com- 
mitte is to consist of 4 lawyers, 3 engineers and 3 doctors, how many such committees 
are possible ? 

8. An electrical engineer is faced with 8 brown wires and 9 blue wires. If he is to connect 

4 brown wires and 3 blue wires to 7 numbered terminals, in how many ways can this 
be done ? 

9. A railway coach has 12 seats facing backwards and 12 seats facing forwards, In how 
many ways can 10 passengers be seated if 2 refuse to face forwards and 4 refuse to face 
backwards ? 
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19.2.5 ANSWERS TO EXERCISES 


1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 


(a) 120; (b) 20; (c) 840; (d) 35. 

(a) 21; (b) 120; (c) 10. 

6 P 4 = 360. 

12 C 3 = 220. 

7 C 3 x 5 C 2 = 350. 

3! x 8 x 5! = 5760. 

6 Ci x 8 C 3 x 5 C 3 = 8400. 

8 C 4 x 9 C 3 x 7! = 29, 635, 200. 

12 P 2 x 12 P 4 x 18 P 4 = 115, 165,670,400. 
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UNIT 19.3 - PROBABILITY 3 - RANDOM VARIABLES 


19.3.1 DEFINING RANDOM VARIABLES 

(i) The kind of experiments discussed in the theory of probability are usually what are known 

as “random experiments”. 

For example, in an experiment which involves throwing a die, suppose that the die is “unbi- 
ased”. This means that it is just as likely to show one face as any other. 

Similarly, drawing 6 numbers out of a possible 45 for a lottery is a random experiment, 
provided it is just as likely for one number to be drawn as any other. 

In general, an experiment is a random experiment if there is more than one possible outcome 
(or event) and any one of those possible outcomes may occur. We assume that the outcomes 
are mutually exclusive (see Unit 19.1, section 19.1.4). 

The probabilities of the possible outcomes of a random experiment form a collection called 
the “probability distribution” of the experiment. These probabilities need not be the 
same as one another. 

The complete list of possible outcomes is called the “sample space” of the experiment. 

(ii) In a random experiment, each of the possible outcomes may, for convenience, be asso- 
ciated with a certain numerical value called a “random variable”. This variable, which 
we shall call x in general, makes it possible to refer to an outcome without having to use a 
complete description of it. 

In tossing a coin, for instance, we might associate a head with the number 1 and a tail with 
the number 0; then we could state the probabilities of either a head or a tail being obtained 
by means of the formulae 


P(x = 1) = 0.5 and P(x = 1) = 0.5 


Note: 

There is no restriction on the way we define the values of x ; it would have been just as correct 
to associate a head with —1 and a tail with 1. But it is customary to assign the values of 
random variables as logically as possible. For example, in discussing the probability that 
two 6’s would be obtained in 5 throws of a dice, we could sensibly use x = 1,2, 3, 4, 5 and 6, 
respectively, for the results that a 1,2, 3, 4, 5 and 6 would be thrown. 
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(iii) It is usually necessary to distinguish between random variables which are “discrete” 
and those which are “continuous” . 

Discrete random variables may take only certain specified values, while continuous random 
variables may take any value with in a certain specified range. 

Examples of discrete random variables include those associated with the tossing of coins, the 
throwing of dice and numbers of defective components in a batch from a production line. 

Examples of continuous variables include those associated with persons’ height or weight, 
lifetimes of manufactured components and rainfall figures. 

Note: 

For a random variable, x , the associated probabilities form a function, P(x ), called the 

“probability function” . 

19.3.2 PROBABILITY DISTRIBUTION AND 
PROBABILITY DENSITY FUNCTIONS 

(a) Probability Distribution Functions 

A “probability distribution function”, which will normally be denoted here by F(x ), is 

the relationship between a random variable, x, and the probability of obtaining any outcome 
for which the random variable can take values up to and including x. In other words, it is 
the probability, P(< x), that the random variable for the outcome is less than or equal to x. 

(i) Probability distribution functions for discrete variables 

By way of illustration, suppose that the number of ships arriving at a container terminal 
during any one day can be 0,1, 2, 3 or 4, with respective probabilites 0.1, 0.3, 0.35, 0.2 and 
0.05. The probabilites for outcomes other than those specified is taken to be zero. 
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The graph of the probability distribution function is as follows: 

F(x) x 

0.8 
0.6 

Prob. 0.4 

0.2 

0 12 3 4 

Arrivals 

The value of the probability distribution function at a value, x, of the random variable is 
the sum of the probabilities to the left of, and including, x. 

In view of the discontinuous nature of the graph, we have used “bullet” marks to indicate 
which end of each horizontal line belongs to the graph. 

(ii) Probability distribution functions for continuous variables 

For a continuous random variable, the probability distribution function is defined in a similar 
way as for a discrete variable. It measures (as before) the probability that the value of the 
random variable is less than or equal to x. 

By way of illustration, we shall quote, here, the example of an “exponential distribution” 
in which it may be shown that the lifetime of a certain electronic component (in thousands 
of hours) is represented by a probability distribution function 

F( x j = 1 1 — e“5 when x > 0; 

l 0 when x < 0. 
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The graph of the probability distribution function is as follows: 


F (x) 1 

0.8 

0.6 

0.4 

0.2 


0 2 4 6 8 


sc 


(b) Probability Density Functions 

In the case of continuous random variables, a second function, f(x) called the “probability 
density function” is defined to be the derivative, with respect to x, of the probability 
distribution function, F(x). 

That is, 


f( x ) = ^[ F (®)]- 

The probability density function measures the concentration of possible values of x. 
In the previous example, the probability density function is therefore given by 

{ \e~% when x > 0; 

0 when x < 0 
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The graph of the probability density function is as follows: 


f(x) x 

0.8 

0.6 

0.4 

0.2 


0 2 4 6 8 


sc 


We may observe that most components have short lifetimes, while a small number can survive 
for much longer. 

(c) Properties of probability distribution and probability density functions 

The following properties are a consequence of the appropriate definitions: 


(i) 

lim Fix) = 0 and lim Fix) = 1. 

x—>—oo x — >oo 


Proof: 

It is impossible for a random variable to have a value less than — oo and it is certain to have 
a value less than oo. 

(ii) If X\ < X 2 , then F(x i) < F(x 2 ). 

Proof: 

The outcomes of an experiment with random variable values up to and including X 2 includes 
those outcomes with random variable values up to and including X\ so that F(x 2 ) is as least 
as great as Fix 1 ). 

Note: 

Results (i) and (ii) imply that, for any value of x, the probability distribution function is 
either constant or increasing between 0 and 1. 
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(iii) The probability that an outcome will have a random variable value, x, within the range 
xi < x < X 2 is given by the expression 


F(x 2 ) - F(x 1 ). 


Proof: 

From, the outcomes of an experiment with random variable values up to and including X 2 , 
suppose we exclude those outcomes with random variable values up to and including x\. 
The residue will be those outcomes with random variable values which lie within the range 

X\ < X < X 2 - 


Thus, the difference between the values of the probability distribution function at two par- 
ticular points is the probability that the value of the random variable will either lie between 
those two points or will be equal to the higher of the two. 

Note: 

For a continuous random variable, this is also equal to the area under the graph of the 
probability density function between the two given points, by virtue of the definition that 
f(x) = i[F(4 

That is, 


rx 2 

/ f(x) dx. 

J X\ 


(iv) 


Proof: 


/ OO 

f(x) dx 

-OO 


1 . 


The total area under the probability density function must be 1 since the random variable 
must have a value somewhere. 

EXAMPLE 

For the distribution of component lifetimes (in thousands of hours) given earlier by 


f 1 — e 2 when x > 0; 
l 0 when x < 0, 
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determine the proportion of components which last longer than 3000 hours but less than or 
equal to 6000 hours. 

Solution 

The probability that components have lifetimes up to and including 3000 hours is given by 

F( 3) = l-e.-f. 


The probability that components have lifetimes up to and including 600 hours is given by 

F( 6) = 1 — e~% = 1 — e~ 3 . 


The probability that components last longer than 3000 hours but less than or equal to 6000 
hours is thus given by 


F( 6) - F( 3) = e“i - e" 3 ~ 0.173 


The required proportion is thus approximately one in six. 

19.3.3 EXERCISES 

1. A coin is tossed three times and the random variable, x, represents the number of heads 
minus the number of tails. 

Construct a definition for the probability distribution function, F(x ), 

(a) if the coin is ‘fair’ (perfectly balanced); 

(b) if the coin is biased so that a head is twice as likely to occur as a tail. 

2. Construct a definition for the probability distribution function, F(x), for the sum, x , 
of numbers obtained when a pair of dice is tossed. 

3. A certain assembly process is such that the probability of success at each attempt is 
0.2. The probability, P(x) that x independent attempts are needed to achieve success 
is given by 


P(x) = (0.2) (0.8)*- 1 x = 1, 2, 3,. . . 
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Plot a grap of the probability distribution function, F(x ), and determine the probability 
that success will be achieved by 

(a) less than four independent attempts; 

(b) more than three but less than or equal to five independent attempts. 

4. The probability density function of a random variable, x, is given by 

for 0 < x < 4; 
elswhere, 

where c is a constant. 

Determine 

(a) the value of c; 

(b) the probability distribution function, F(x); 

(c) the probability that x > 1. 

5. The shelf life (in hours) of a certain perishable packaged food is a random variable, x , 
with probability density function, f(x ) given by 

20000(a; + 100) -3 when x > 0; 

0 otherwise. 

Determine the probabilities that one of these packages will have a shelf life of 

(a) at least 200 hours; 

(b) at most 200 hours; 

(c) more than 80 hours but less than or equal to 120 hours. 




19.3.4 ANSWERS TO EXERCISES 


1. (a) 


F(x) 


| for — 3 < x < — 1; 
| for — 1 < x < 1; 

| for 1 < x < 3; 

1 for x > 3. 
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< 



(b) 


2 . 


3. (a) 0.488 (b) 0.3123 

4. (a) c= 

(b) 


(c) I 

5. (a) §; (b) |; (c) 0.102 


F for — 3 < x < —1; 
44 for — 1 < x < 1; 

F(x) = < 

4| for 1 < x < 3; 

,1 for x > 3. 

' 4g for 2 < x < 3; 
l 2 for 3 < x < 4; 

I for 4 < x < 5: 

D — 

4| for 5 < x < 6; 

44 for 6 < x < 7; 

F(x) = < h. for 7 < a; < 8. 

4| for 8 < x < 9; 

| for 9 < x < 10; 

4| for 10 < x < 11; 
f for 11 < x < 12; 

.1 for x > 12. 

' 0 when x < 0; 

F(x) = < \y/x when 0 < x < 4; 

1 when x > 4. 
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UNIT 19.4 - PROBABILITY 4 

MEASURES OF LOCATION AND DISPERSION 


19.4.1 COMMON TYPES OF MEASURE 

We include, here three common measures of location (or central tendency), and one common 
measure of dispersion (or scatter), used in the discussion of probability distributions. 

(a) The Mean 

(i) For Discrete Random Variables 

If the values x\, X 2 , £ 3 , . . . . , x n of a discrete random variable, x, have probabilities 
Pi, P 2 , P 3 , .... , P n , respectively, then Pi represents the expected frequency of Xi divided 
by the total number of possible outcomes. For example, if the probability of a certain value 
of x is 0.25, then there is a one in four chance of its occurring. 

The arithmetic mean, /i, of the distribution may therefore be given by the formula 


At = x i p i- 

i= 1 

(ii) For Continuous Random Variables 

In this case, it is necessary to use the probability density function, f(x), for the distribution 
which is the rate of increase of the probability distribution function, F(x). 

For a small interval, 5x of a> values, the probability that any of these values occurs is ap- 
proximately f(x)8x , which leads to the formula 





xf(x) dx. 


(b) The Median 

(i) For Discrete Random Variables 

The median provides an estimate of the middle value of x, taking into account the frequency 
at which each value occurs. More precisely, it is a value, m, of the random variable, x, for 
which 


P(x < m ) > 


1 

2 


and 


P(x > m) > 


1 

2 ' 
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The median for a discrete random variable may not be unique (see Example 1, on page 3). 

(ii) For Continuous Random Variables 

The median for a continuous random variable is a value of the random variable, x, for which 
there are equal chances of x being greater than or less than the median itself. More precisely, 
it may be defined as the value, m, for which P{x < m) = F(m) = 

Note: 

Other measures of location are sometimes used, such as “quartiles” , “deciles” and “per- 
centiles”, which divide the range of x values into four, ten and one hundred equal parts, 
respectively. For example, the third quartile of a distribution function, F(x), may be defined 
as a value, g 3 , of the random variable, x, such that 


(c) The Mode 

The mode is a measure of the most likely value occurring of the random variable, x. 

(i) For Discrete Random Variables 

In this case, the mode is any value of x with the highest probability, and, again, it may not 
be unique (see Example 1, on page 3). 

(ii) For Continuous Random Variables 

In this case, we require a value of x for which the probability density function (measuring 
the concentration of x values) has a maximum. 

(d) The Standard Deviation 

The most common meaure of dispersion (or scatter) for a probability distribution is the 

“standard deviation”, a. 

(i) For Discrete Random Variables 

In this case, the standard deviation is defined by the formula 


a = 


\ 


£ 

1=1 


[Xi 


^P(x) 
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(ii) For Continuous Random Variables 

In this case, the standard deviation is defined by the formula 


a = 



(. x — n) 2 f(x) dx, 


where f(x) denotes the probability density function. 

Each measures the dispersion of the x values around the mean, //. 

Note: 

cr 2 is known as the “variance” of the probability distribution. 

EXAMPLES 

1. Determine (a) the mean, (b) the median, (c) the mode and (d) the standard deviation 
for a simple toss of an unbiased die. 

Solution 

(a) The mean is given by 


u = i x - 


22 

~ 6 ~ 


= 3.5 


(b) Both 3 and 4 on the die fit the definition of a median, since 

P (x < 3) = ^ P(x > 3) = ^ 

and 

P(x < 4) = jj, P(x >4) = ^. 

(c) All six outcomes count as a mode since they all have a probability of |. 

(d) The standard deviation is given by 


a = 


\ 


X) -(f-3.5) 2 ^ 
At 6 


2.917 
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2. Determine (a) the mean, (b) the median, (c) the mode and (d) the standard deviation 
for the distribution function 


F(x) 


1 — e 2 when x > 0; 
0 when x < 0. 


Solution 

First, we need the probability density function, f(x), which is given by 


when x > 0; 
when x < 0 

Hence, 

(a) 


f(x) = 


1 -- 

-e 2 

2 c 


H — / -xe 2 dx, 
jo 2 


which, on integration by parts, gives 






+ 



e 2 da; 




2 . 


(b) The median is the value, m, for which 


F(m) 


1 

2 ' 


That is, 


giving 


l~e~ 


m 

2 


1 

2 ’ 



J 


and, hence, m ~ 1.386. 

(c) The mode is zero, since the maximum value of the probability density function 
occurs when x — 0. 
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(d) The standard deviation is given by 


<x 2 = 


poo 

/ -(x — 2 ) 2 e _ 2 dx, 

Jo 2 

which, on integration by parts, gives 

[(x — 2) 2 e _ ^| + [ 2(x — 2)e - ^ dx 


a 2 = 


Jo Jo 


= 4 


4(x — 2)e 2 


4e 2 dx = 4. 


Thus a — 2. 


19.4.2 EXERCISES 


1. A probability function, P(x), is defined by the following table: 


X 

0 

1 

2 

3 

4 

5 

6 

P(x) 

0.17 

0.29 

0.27 

0.16 

0.07 

0.03 

0.01 


Determine (a) the mean, (b) the median, (c) the mode and (d) the standard deviaton 
of this distribution. 

2. A certain assembly process is such that the probability of success at each attempt is 
0.2. The probability, P(x) that x independent attempts are needed to achieve success 
is given by 

P(x) = (0.2) (0.8)* -1 x = 1, 2, 3, . . . 

Determine (a) the mean, (b) the median and (c) the standard deviation of this distri- 
bution. 

3. The running distance (in thousands of kilometres) which car owners achieve from a 
certain type of tyre is a random variable with probability density function, /(x), where 


f(x) 


_L_ e x/30 w ]- len x > 0; 

0 when x < 0, 
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Determine 

(a) the probability that one of these tyres will last at most 19000km; 

(b) (i) the mean, (ii) the median and (iii) the standard deviation of the distribution. 

4. The probability density function, f(x), of a random variable, x, is defined by 


f(x) 


30a: 2 (l — x ) 2 when 0 < x < 1; 
0 otherwise. 


Determine the probability that x will take a value within two standard deviations of its 
mean. 


5. The probability density function, f(x), of a random variable, x, is given by 


! i xe x 2 / 12 when x > 0; 

0 otherwise. 

Determine (a) the median and (b) the mode for this distribution. Show also, (c), that 
the mean, /i, is given by 





e -x 2 /12 


&X. 


19.4.3 ANSWERS TO EXERCISES 


1. (a) Mean = 1.8; (b) median = 2; (c) mode = 1; (d) standard deviation = 1.34 

2. (a) Mean = 5; (b) median = 3; (c) standard deviation = 4.47 

3. (a) 0.47; (b) (i) mean = 30; (ii) median = 20.8; (iii) standard deviation = 30. 

4. 0.969 

5. (a) Median = \/l21n2; (b) mode = \/6. 
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UNIT 19.5 - PROBABILITY 5 - THE BINOMIAL DISTRIBUTION 


19.5.1 INTRODUCTION AND THEORY 

In this Unit, we shall be concerned, firstly, with probability problems having only two events 
(which are mutually exclusive and independent), although many trials may be possible. For 
example, the pairs of events could be “up and down”, “black and white”, “good and bad”, 
and, in general, “successful and unsuccessful”. 

Statement of the problem 

Suppose that the probability of success in a single trial is unaffected when successive trials 
are carried out (that is, we have independent events). Then what is the probability that, in 
n successive trials, exactly r will be successful ? 

General Analysis of the problem 

Let us build up the solution in simple stages: 

(a) If p is the probability of success in a single trial, then the probability of failure is 1 — p = q, 
say. 

(b) In the following table, let S stand for success and let F stand for failure. The table shows 
the possible results of one, two or three trials and their corresponding probabilities: 


NO. OF TRIALS 

POSSIBLE RESULTS 

RESPECTIVE PROBABILITIES 

1 

F,S 

q,p 

2 

FF,FS,SF,SS 

q 2 ,qp,pq,p 2 

3 

FFF,FFS,FSF,FSS, 

SFF,SFS,SSF,SSS 

q :i , q 2 p, q 2 p, qp 2 , 

9 9 o Q 

g p, qp , qp^p 


(c) Summary 

(i) In one trial, the probabilities that there will be exactly 0 or exactly 1 successes are the 
respective terms of the expression 

q + p. 

(ii) In two trials, the probabilities that there will be exactly 0, exactly 1 or exactly 2 successes 
are the respective terms of the expression 

q 2 + 2qp + p 2 ] that is, ( q + p ) 2 . 
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(iii) In three trials, the probabilities that there will be exactly 0, exactly 1, exactly 2 or 
exactly 3 successes are the respective terms of the expression 

q 3 + 3 q 2 p + 3 qp 2 + p 3 ; that is, (q + p) 3 . 

(iv) In any number, n, of trials, the probabilities that there will be exactly 0, exactly 1, 

exactly 2, exactly 3, or exactly n successes are the respective terms in the binomial 

expansion of the expression 

(« q + p) n ■ 


(d) MAIN RESULT: 

The probability that, in n trials, there will be exactly r successes, is the term containing p r 
in the binomial expansion of (q + p) n . 

It can be shown that this is the value of 


n C r q n ~ r p r . 


EXAMPLES 


1. Determine the probability that, in 6 tosses of a coin, there will be exactly 4 heads. 

Solution 

q = 0.5, p = 0.5, n = 6, r — 4. 

Hence, the required probability is given by 


6 U 4 .(0.5) 2 .(0.5) 4 


6 ! 1 1 
2!4!'4'16 


15 

64' 


2. Determine the probability of obtaining the most probable number of heads in 6 tosses 
of a coin. 

Solution 

The most probable number of heads is given by | x 6 = 3. 

The probability of obtaining exactly 3 heads is given by 


6 C 3 .(0.5) 3 (0.5) 3 


6 ! 1 1 
3!3!'8'8 


20 

64 


~ 0.31 
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3. Determine the probability of obtaining exactly 2 fives in 7 throws of a die. 

Solution 

Q = f, p=b n = 7, r = 2. 

Hence, the required probability is given by 


7 C 2 .(^ 


1\ 2 _ 7! /5\ 


- = 


6J 


5!2!A6 J ' \6 


- . - ~ 0.234 


4. Determine the probability of throwing at most 2 sixes in 6 throws of a die. 

Solution 

The phrase “at most 2 sixes” means exactly 0, or exactly 1, or exactly 2. Hence, we 
add together the first three terms in the expansion of (q + p) 6 , where q = | and p — 

It may be shown that 

(q + p ) 6 = q 6 + 6g 5 p + 15g 4 p 2 + 


and, by substituting for q and p, the sum of the first three terms turns out to be 


21875 

23328 


~ 0.938 


5. It is known that 10% of certain components manufactured are defective. If a random 
sample of 12 such components is taken, what is the probability that at least 9 are 
defective ? 

Solution 

Firstly, we note how the information suggests that removal of components for exami- 
nation does not affect the probability of 10%. This is reasonable, since our sample is 
almost certainly very small compared with all components in existence. 

Secondly, the probability of success in this example is 0.1, even though it refers to 
defective items, and hence the probability of failure is 0.9. 

Thirdly, using p = 0.1, q = 0.9, n = 12, we require the probabilities (added together) 
of exactly 9, 10, 11 or 12 defective items and these are the last four terms in the 
expansion of (q + p) n . 

That is, 

12 C 9 .(0.9) 3 .(0.1) 9 + ^Cio^O.g) 2 .^.!) 10 + 12 C' n .(0.9).(0.1) 11 + (0.1) 12 ~ 1.658 x 10“ 7 . 


Note: 

The use of the “binomial distribution” becomes very tedious when the number of trials 
is large and two other standard distributions (called the “normal distribution” and the 
“Poisson distribution” ) can sometimes be used. 
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19.5.2 EXERCISES 


1. A coin is tossed six times. What is the probability of getting exactly four heads ? 

2. What is the probability of throwing at least four 7’s in five throws of a pair of dice ? 

3. In a roll of five dice, what is the probability of getting exactly four faces alike ? 

4. If three dice are thrown, determine the probability that 

(a) all three will show the number 4; 

(b) all three will be alike; 

(c) two will show the number 4 and the third, something else; 

(d) all three will be different; 

(e) only two will be alike. 

5. Hospital records show that 10% of the cases of a certain disease are fatal. If five patients 
suffer from this disease, determine the probability that 

(a) all will recover; 

(b) at least three will die; 

(c) exactly three will die; 

(d) a particular three will die and the others survive. 

6. A particular student can solve, on average, half of the problems given to him. In order 
to pass the course, he is required to solve seven out of ten problems on an examination 
paper. What is the probability that he will pass ? 


19.5.3 ANSWERS TO EXERCISES 

1 . 


2 . 


3. 


15 x (0.5) 4 x 0.5 ~ 0.47 


1)% 6X (!)%(!„ 0.003, 


,' 1\ 4 ( 5 \ 

6 x 5 x ( - ) x - ) ~ 0.019 

. 6 / \ 6 , 
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4. (a) 

(b) 

(c) 

(d) 

(e) 

5. (a) 

(b) 

(c) 

(d) 


6 . 


1\ 3 

- ) ~ 0.0046; 
6 / 


3 x Q) 2x (^°.°69; 


6 x Q) ~ 0.028; 


1-6 




~ 0.56; 


6 x 6 x Q) ~ 0.168 


(0.9) e 


(0.1) 5 + 5 x (0.9) x (O.l) 4 + 10 x (0.9) 2 x (0.1) ; 


10 x (0.9) 2 x (O.l) 3 ~ 0.0081; 


10 x (0.9) 2 x (0.1) s 


~ 0.00081 


(1 + 10 + 45 + 120) x (0.5) 10 ~ 0.17 


0.0086 
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UNIT 19.6 - PROBABILITY 6 


STATISTICS FOR THE BINOMIAL DISTRIBUTION 
19.6.1 CONSTRUCTION OF HISTOGRAMS 

Elementary discussion on the presentation of data, in the form of frequency tables, histograms 
etc., usually involves experiments which are actually carried out. 

But we illustrate now how the binomial distribution may be used to estimate the results of 
a certain kind of experiment before it is performed. 

EXAMPLE 

For four coins, tossed 32 times, construct a histogram showing the expected number of 
occurrences of 0,1, 2, 3, 4 heads. 

Solution 

Firstly, in a single toss of the four coins, the probability of head (or tail) for each coin is |. 

The terms in the expansion of give the probabilities of exactly 0,1, 2, 3 and 4 heads, 

respectively. 

The expansion is 



showing that the probabilities of 0,1, 2, 3 and 4 heads in a single toss of four coins are 
,6, i and /g, respectively. 

Therefore, in 32 tosses of four coins, we may expect 0 heads, twice; 1 head, 8 times; 2 heads, 
12 times; 3 heads, 8 times and 4 heads, twice. 

The following histogram uses class- intervals for which each member is, in fact, situated at 
the mid-point: 
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No. of heads in 32 tosses of 4 coins 


Notes: 

(i) The only reason that the above histogram is symmetrical in shape is that the probability 
of success and failure are equal to each other, so that the terms of the binomial expansion 
are, themselves, symmetrical. 

(ii) Since the widths of the class- intervals in the above histogram are 1, the areas of the 
rectangles are equal to their heights. Thus, for example, the total area of the first three 
rectangles represents the expected number of times of obtaining at most 2 heads in 32 tosses 
of 4 coins. 

19.6.2 MEAN AND STANDARD DEVIATION OF A 
BINOMIAL DISTRIBUTION 

THEOREM 

If p is the probability of success of an event in a single trial and q is the probability of 

its failure, then the binomial distribution, giving the expected frequencies of 0, 1,2,3, n 

successes in n trials, has a mean of rip and a standard deviation of y/npq, irrespective of the 
number of times the experiment is to be carried out. 
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Proof: 


(a) The Mean 

From the binomial expansion formula, 

(? + p) n = q n + nq 7l ~ 1 p + % "~ 2 p 2 + ^ + . . . + nqp - 1 + p\ 

Hence, if the n trials are made N times, the average number of successes is equal to the 
following expression, multiplied by N, then divided by N: 

0 x q n + 1 x nq n ~ l p + 2 x ~ l \ n ~ 2 p 2 + 

3 x n ( n ~ ^ - 2 \ n -y + . .(n - 1) x nqp n ~ l + np n . 

That is, the mean is 

rip [q n ~ l + (n - 1 )q n ~ 2 p + — —q n ~ 3 p 2 + . . . + (n - 1 )qp n ~ 2 + p n 


= np(q + p) n 1 = rip since q + p = 1. 


(b) The Standard Deviation 

For the standard deviation, we observe that, if f r is the frequency of r successes when the n 
trials are conducted N times, then 


77 ' 

fr = N- ^q n - r p 

[n — ry.rl 


We use this, first, to establish a result for 


E r 2 fr. 


r=0 
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For example, 

0 2 /o = 0 .Nq n = 0 = O./o- 
l 2 /i = l.Nnq n ~ l p = l./i. 

2 2 / 2 = 

2 Nn(n — 1 )q n ~ 2 p 2 = Nn{n — 1 )q n ~ 2 p 2 + Nn{n — l)p 2 q n ~ 2 


= 2/ 2 + Nn(n - l)p 2 q n 2 ; 


3 2 /s = 


3iV 


n(n — l)(n — 2) „ , , , n(n — l)(n — 2) q . . 9/ . 

n V 3 p 3 = N — -g' 3 p 3 + Nn(n - l)p (n - 2)q n 3 p 


2 ! 


2 ! 


= 3/3 + Nn(n — 1 )p 2 (n — 2 )q n 3 p 


4 2 / 4 = 

AN 


n( n ~ 1 )(n - 2)(n - 3) _ Ar n(n - l)(n - 2) (w-3) gW _y | ^(n - 2)(n - 3) ^ n _ 4 ^ 2 


3! 


-q^p 1 = N- 


3! 


2 ! 


-q p 


= 4/4 + Nn(n — l)p 2 — 2)(n 3) ^" 4 - 2 - 


2 ! 


-q p ; 


and, in general, when r > 2, 
r 2 fr = 

n(n — 1 )(n — 2)....(n — r + 1) 


AT 


(r — 1)! 

This result, together with those for 0 2 ./o and l 2 /i, shows that 


f Nn(n— l)p 2 q n r p r = rf r +Nn(n—l)p 2 ^ ^ ’ " n r ~' r 2 


(n — r)!(r — 2)! 


q p 


E r2 /r = E r fr + - l)p 2 V 2 


r=0 r=0 


(n — r)!(r — 2)! 
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That is, 


y, r 2 f r = Nnp + Nn(n — l)p 2 (g + p) n 2 = IVrap + Nn(n — 1 )p 2 , 

r = 0 


since g + p — 1. 

It was also established, in Unit 18.3, that the standard deviation of a set, xi,x 2 , x 3 , . . . x m , 
of m observations, with a mean value of x, is given by the formula 


a 


2 




which, in the present case, may be written 


a 


2 




r= 0 


1 

N* 



2 


Hence, 

o- 2 = ^ ( Nnp + Nn(n - l)p 2 ) - -j^(Nnp) 2 , 

which gives 


0 0 0 0 0 0 / \ 
a — np + n p — rip — n p — np( 1 — p) — npq ; 


and so, 


a = y/npq. 
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ILLUSTRATION 


For direct calculation of the mean and the standard deviation for the data in the previous 
coin-tossing problem, we may use the following table, in which Xi denotes numbers of heads 
and fi denotes the corresponding expected frequencies: 


Xi 

fi 

fiXi 

fzx'i 

0 

2 

0 

0 

1 

8 

8 

8 

2 

12 

24 

48 

3 

8 

24 

72 

4 

2 

8 

32 

Totals 

32 

64 

160 


The mean is given by 


x = 


64 

32 


2 (obviously) , 


which agrees with np = 4 x 
The standard deviation is given by 


<7 = 


/ 



1 , 


which agrees with y ' npq = ^4 x | x 

Note: 

If the experiment were carried out N times instead of 32 times, all values in the last three 
columns of the above table would be multiplied by a factor of ||, which would then cancel 
out in the remaining calculations. 

EXAMPLE 

Three dice are rolled 216 times. Construct a binomial distribution and show the frequencies 
of occurrence for 0,1,2 and 3 sixes. 

Evaluate the mean and the standard deviation of the distribution. 
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Solution 


First of all, the probability of success in obtaining a six with a single throw of a die is and 
the corresponding probability of failure is |. 

For a single throw of three dice, we require the expansion 


1 

6 


+ 






3 


showing that the probabilities of 0,1,2 and 3 sixes are |||, 2 \ 5 6 and 2 J 6 , respectively. 

Hence, in 216 throws of the three dice we may expect 0 sixes, 125 times; 1 six, 75 times; 2 
sixes, 15 times and 3 sixes, once. The corresponding histogram is as follows: 


Freq. 



No. of sixes in 216 throws of 3 dice 
From the previous Theorem, the mean value is 


3 x 


1 _ 1 
6 “ 2 


and the standard deviation is 



a/15 

~ 6 ~' 
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19.6.3 EXERCISES 


1. Four dice are rolled 81 times. If less than 5 on a die is considered to be a success, and 
everything else a failure, 

(a) draw the corresponding histogram for the expected frequencies of success; 

(b) determine the expected number of times of obtaining at least three successes among 
the four dice; 

(c) shade the area of the histogram which is a measure of the result in (c); 

(d) calculate the mean and the standard deviation of the frequency distribution in (a). 

2. In a seed- viability test, 450 seeds were placed on a filter-paper in 90 rows of 5. The 
number of seeds that germinated in each row were counted, and the results were as 
follows: 


No. of seeds germinating per row 

0 

1 

2 

3 

4 

5 

Observed Frequency of rows 

0 

1 

11 

30 

38 

10 


If the germinating seeds were distributed, at random, among the rows, we would expect 
a binomial distribution with an index of n = 5. 

Determine 

(a) the average number of seeds germinating per row; 

(b) the probability of a single seed germinating; 

(c) the expected frequencies of rows for each number of seeds germinating; 

Draw the histogram for the expected frequencies and the histogram for the observed 
frequencies. 




19.6.4 ANSWERS TO EXERCISES 


1. (a) The histogram is as follows: 


Freq. 


0 12 3 4 

No. of “less than 5’s” in 81 throws of 4 dice 

(b) Expected frequency of 3 or 4 successes = 32 + 16 = 48; 

(c) Shade the last two rectangles on the right of the histogram; 

(d) Mean = 4 x | = | and Standard Deviation = ^4 x | x | 

2. (a) Average number of seeds germinating per row is ^ = 3.50; 

(b) Probability of a single seed germinating is ^ 

(c) Expected frequencies for 0,1, 2, 3, 4, 5 seeds are 0.2187, 2.5515, 11.9070, 27.7830, 
32.4135, 15.1263 respectively. 

(d) The histograms are as follows: 


Estimated 
no. 
of 

rows 

0 12 3 4 

No. of seeds germinating 
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UNIT 19.7 - PROBABILITY 7 


THE POISSON DISTRIBUTION 
19.7.1 THE THEORY 

We recall that, in a binomial distribution of n trials, the probability, P r , that an event occurs 
exactly r times out of a possible n is given by 


Pr 


ni 


(n — r)\r\ 


P r q n ~ r , 


where p is the probability of success in a single trial and q = 1 — p is the probability of 
failure. 

Now suppose that n is very large compared with r and that p is very small compared with 
1. 

Then, 

(a) 

Til 

= n(n — l)(n — 2) (n — r + 1) ~ n r . 

( n — r)\ 

ILLUSTRATION 

If n = 120 and r = 3, then 

n \ 120 ! 

7 : — = ' = 120 x 119 x 118 ~ 120 3 . 

(n-r)! 117! 


(b) 


q r = (1 -p) r ~ 1, 


so that 


q n-r ^ q n = (! _ p y 


1 



We may deduce that 


P 

1 r — 


n r p r ( 1 - p) r 


T\ 


(np) r ( 1 — p) n 


or 


P rv 
1 r 


(■ np) r 


T\ 


1 — np + 


n(n 


2 ! 


1 ) 2 
— P 


n(n 


l)(n 


3 ! 


2 ) 3 

— p 3 + 





1 — np + 


(np) 2 

2 ! 


M 3 

3 ! 


Hence, 

p r „ 

V \ 

The number, np, in this formula is of special significance, being the average number of 
successes to be expected a single set of n trials. 

If we denote np by /i, we obtain the “Poisson distribution” formula, 


P rv 
^ r — 


p r e-» 


Notes: 

(i) Although the formula has been derived from the binomial distribution, as an approx- 
imation, it may also be used in its own right, in which case we drop the approximation 
sign. 

(ii) The Poisson distribution is more use than the binomial distribution when n is a very large 
number, the binomial distribution requiring the tedious evaluation of its various coefficients. 

(iii) The Poisson distribution is of particular use when the average frequency of occurrence 
of an event is known, but not the number of trials. 
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EXAMPLES 


1. The number of cars passing over a toll-bridge during the time interval from 10a.m. until 
11a.m. is 1,200. 

(a) Determine the probability that not more than 4 cars will pass during the time 
interval from 10.45a.m. until 10.46a.m. 

(b) Determine the probability that 5 or more cars pass during the same interval. 

Solution 

The number of cars which pass in 60 minutes is 1200, so that the average number of 
cars passing, per minute, is 20 = /i. 

(a) The probability that not more than 4 cars pass in a one-minute interval is the sum 
of the probabilities for 0, 1,2,3 and 4 cars. 

That is, 

4 (20) r e -20 _ 

2 -*/ D — 


( 20 )° ( 20) 1 ( 20) 2 ( 20) 3 ( 20) 4 1 20 _ 

0! 1! 2! + 3! 4! J 

8221e“ 20 ~ 1.69 x 10" 5 . 

(b) The probability that 5 or more cars will pass in a one-minute interval is the proba- 
bility of failure in (a). In other words, 

1 _ A (20) r e~ 2 ° _ 


1 - 8221e“ 20 ~ 0.99998 

2. A company finds that, on average, there is a claim for damages which it must pay 7 

times in every 10 years. It has expensive insurance to cover this situation. 

The premium has just been increased, and the firm is considering letting the insurance 

lapse for 12 months as it can afford to meet a single claim. 
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Assuming a Poisson distribution, what is the probability that there will be at least two 
claims during the year ? 

Solution 

Using /jl = A = 0.7, the probability that there will be at most one claim during the 
year is given by 


P 0 + P 1 = e~ 0 - 7 + e _0,7 0.7 = e°' 7 (l + 0.7). 


The probability that there will be at least two claims during the year is given by 


1 - e _0 ' 7 (l + 0.7) ~ 0.1558 


3. There is a probability of 0.005 that a welding machine will produce a faulty joint when 
it is operated. The machine is used to weld 1000 rivets. Determine the probability that 
at least three of these are faulty. 

Solution 

First, we have fi = 0.005 x 1000 = 5. 

Hence, the probability that at most two will be faulty is given by 


Po + Pi + P 2 = e~ 5 


'5° 5 1 5 2 " 

Of + 1 \ + 2!" 


That is, 


e -5 [l + 5 + 12.5] ~ 0.125 

Hence, the probability that at least three will be faulty is approximately 

1 - 0.125 = 0.875 
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19.7.2 EXERCISES 


1. The probability that a glass fibre will shatter during an experiment is believed to follow 
a Poisson distribution. 

In a certain apparatus, it was found that, on average, 7 glass fibres shattered. 
Determine the probability that, during a single demonstration of the experiment, 

(a) two glass fibres will shatter; 

(b) at least one glass fibre will shatter. 

2. A major airline operates 350 flights per day throughout the world. The probability that 
a flight will be delayed for more than one hour, for any reason, is 0.7%. If more than 
four flights suffer such delays in one day, the implications for route organisation and 
crewing become serious. Calling such a day a “bad day”, determine the probabilities 
that 

(a) any particular day is a bad day; 

(b) at most two bad days occur in one week; 

(c) more than 50 bad days occur in a year of 365 days. 

State your answers correct to three significant figures. 

3. It is known that 3% of bolts made by a certain machine are defective. If the bolts 
are packaged in boxes of 50, determine the probability that a given box will contain 4 
defectives. 

4. If 0.04% of cars break down while driving through a certain tunnel, determine the 
probability that at most 2 break down out of 2000 cars entering the tunnel on a given 
day. State your answer correct to three significant figures. 

19.7.3 ANSWERS TO EXERCISES 

1. (a) 2.28 x 10 -3 ; (b) 0.0676 

2. (a) 0.102; (b) 0.128; (c) 0.011 

3. 0.047 using yU = 1.5 

4. 0.952 using yU = 0.8 
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UNIT 19.8 - PROBABILITY 8 


THE NORMAL DISTRIBUTION 

19.8.1 LIMITING POSITION OF A FREQUENCY POLYGON 

The distribution considered here is also appropriate to examples where the number of tri- 
als is large and, hence, the calculation of frequencies and probabilities, using the binomial 
distribution, would be inconvenient. 

We shall introduce the “normal distribution” by considering the histograms of the bino- 
mial distribution for a toss of 32 coins as the number of coins increases. 

The probability of obtaining a head is \ and the probability of obtaining a tail is also 

(i) One Coin 


32 



+ 




16 + 16. 


Freq. 


i 

16 








0 

1 


No. of heads in 32 tosses of 1 coin 


1 







(iv) Four Coins 



2 + 8 + 12 + 8 + 2 . 


Freq. 


0 12 3 4 

No. of heads in 32 tosses of 4 coins 

It is apparent that, as the number of coins increases, the frequency polygon (consisting of the 
straight lines joining the midpoints of the tops of each rectangle) approaches a symmetrical 
bell-shaped curve. 

This, of course, is true only when the histogram itself is either symmetrical or nearly sym- 
metrical. 

DEFINITION 

As the number of trials increases indefinitely, the limiting position of the frequency polygon 
is called the “normal frequency curve” . 

THEOREM 

In a binomial distribution for N samples of n trials each, where the probability of success in 
a single trial is p, it may be shown that, as n increases indefinitely, the frequency polygon 
approaches a smooth curve, called the “normal curve” , whose equation is 



N (x-x ) 2 
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In this equation 

x is the mean of the binomial distribution = np\ 


a is the standard deviation of the binomial distribution = \Jnp{ 1 — p); 
y is the frequency of occurence of the value, x. 

For example, the histogram for 32 tosses of 4 coins approximates to the following normal 
curve: 


% 


12 

8 


2 


0 12 3 4 


sc 


Notes: 

(i) We omit the proof of the Theorem since it is beyond the scope of the present text. 

(ii) The larger the value of n, the better is the level of approximation. 

(iii) The normal curve is symmetrical about the straight line x = x, since the value of y is 
the same at x = x ± h for any number, h. 

(iv) If the relative frequency (or probability) with which the value, x, occurs is denoted by 
P, then P = y/N and the above relationship can be written 


P 



(x — x)^ 

2<t2 


5 


the graph of which is called the “normal probability curve” . 
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(v) Symmetrical curves are easier to deal with if the vertical axes of co-ordinates is the line 
of symmetry. 

The normal probability curve can be simplified if we move the origin to the point (x, 0) and 
plot Pa on the vertical axis instead of P. 

Letting Pa = Y and = X the equation of the normal probability curve becomes 


Y = 



5 


which represents the “standard normal probability curve” . 

From any point on it, we may obtain the values of the original P and x values by using the 
formulae 


x = aX + x and P 


Y 

a 


Y 


§ 


X 


(vi) If the probability of success, p, in a single trial is not equal to, or approximately equal 
to, 7 ,, then the distribution given by the normal frequency curve and the two subsequent 
curves will be a poor approximation and is seldom used for such cases. 

19.8.2 AREA UNDER THE NORMAL CURVE 

For the histogram of a binomial distribution, corresponding to values of x, suppose that 
x = a and x = b are the values of x at the base-centres of two particular rectangles, where 
b > a and all rectangles have width 1 . 
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Then, the area of the histogram from x — a — \ to x — b+\ represents the number of times 
which we can expect values of x, between x = a and x = b inclusive, to occur. 

Consequently, for a large number of trials, we may use the area under the normal curve 
between x = a — \ and x = b + 

In a similar way, the probability that x will lie between x = a and x — b is represented by 
the area under the normal probability curve from x = a — \ and x — b + We note that 
the total area under this curve must be 1, since it represents the probability that any value 
of x will occur (a certainty). 

In order to make use of a standard normal probabilty curve for the same purpose, the 
conversion formulae from x to X and P to Y must be used. 

Note: 

Tables are commercially available for the area under a standard normal probability curve; 
and, in using such tables, the above conversions will usually be necessary. A sample table is 
given in an appendix at the end of this unit. 

EXAMPLE 

If 12 dice are thrown, determine the probability, using the normal probability curve approx- 
imation, that 7 or more dice will show a 5. 

Solution 

For this example, we use P — q — n — 12 and we need the area under the normal 
probability curve from x = 6.5 to x — 12.5. 

The mean of the binomial distribution, in this case, is x = 12 x | = 2. 

The standard deviation is a = y 2 x | x | ~ \/l.67 ~ 1.29. 

The required area under the standard normal probability curve will be that lying between 


X = — — - ~ 3.49 and X = — — - ~ 8.14 
1.29 1.29 

In practice, we take the whole area to the right of X = 3.49 since the area beyond X = 8.14 
is negligible. 

Also, the total area to the right of X = 0 is 0.5; and, hence, the required area is 0.5 minus 
the area from X = 0 to X = 3.49. 
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From tables, the required area is 0.5 — 0.4998 = 0.0002 and this is the probability that, when 
12 dice are thrown, 7 or more will show a 5. 

Note: 

If we had required the probability that 7 or fewer dice show a 5, we would have needed the 
area under the normal probability curve from x = —0.5 to x — 7.5. This is equivalent to 
taking the whole of the area under the standard normal probability curve which lies to the 
left of 


X 


7.5-2 

1.29 


~ 4.26 


19.8.3 NORMAL DISTRIBUTION FOR CONTINUOUS VARIABLES 

So far, the variable, x, considered in a Normal distribution has been “discrete”; that is, x 
has been able to take only the specific values 0,1, 2, 3 etc. 

Here, we consider the situation when x is a “continuous” variable; that is, it may take any 
value within a certain range appropriate to the problem under consideration. 

For a large number of observations of a continuous variable, the corresponding histogram 
need not have rectangles of class-width 1 but of some other number, say c. 

In this case, it may be shown that the normal curve approximation to the histogram has 
equation 


Nc {x-x) 2 

y = —T7r e 2ff2 • 

cr^ZTT 

The smaller is the value of c, the larger is the number of rectangles and the better is the 
approximation supplied by the curve. 

If we wished to calculate the number of x- values lying between x = a and x = b (where 
b > a), we would need to calculate the area of the histogram from x = a to x = b inclusive, 
then divide by c, since the base-width is no longer 1. 

We conclude that the number of these x-values approximates to the area under the normal 
curve from x = a to x = b; and similarly, the area under the normal probability curve, from 
x = a to x = b gives an estimate for the probability that values of x between x = a and 
x = b will occur. 
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EXAMPLE 


Given a normal distribution of a continuous variable, x, with N = 2000, x = 20 and a — 5, 
determine 

(a) the number of x-values lying between 12 and 22; 

(b) the number of x-values larger than 30. 

Solution 

(a) The area under the normal probability curve between x — 12 and x = 22 is the area 
under the standard normal probability curve from 


X 


12 - 20 
5 


—1.6 to X 


22 - 20 
5 


0.4 


and, from tables, this is 0.4452 + 0.1554 = 0.6006 

Hence, the required number of values is approximately 0.6006 x 2000 ~ 1201. 

(b) The total area under the normal probability curve to the right of x = 30 is the area 
under the standard normal probability curve to the right of 


30 - 20 

YY — 

5 


2 


and, from tables, this is 0.0227 

Hence, the required number of values is approximately 0.0227 x 2000 ~ 45. 

19.8.4 EXERCISES 

1. Use a normal probability curve approximation to determine the probability that, in a 
toss of 9 coins, 3 to 6 heads are shown. 

2. A coin is tossed 100 times. Use a normal probability curve approximation to determine 
the probability of obtaining 

(a) exactly 50 heads; 

(b) 60 or more heads. 



3. Assume that one half of the people in a certain community are regular viewers of 
television. Of 100 investigators, each interviewing 10 individuals, how many would you 
expect to report that 3 people or fewer were regular viewers ? (Use a normal probability 
curve approximation). 

4. A manufacturer knows that, on average, 2% of his products are defective. Using a 
normal probability curve approximation, determine the probability that a batch of 100 
components will contain exactly 5 defectives. 

5. If the average life of a certain make of storage battery is 30 months, with a standard 
deviation of 6 months, what percentage can be expected to last from 24 to 36 months, 
assuming that their lifetimes follow a normal distribution. 

6. If the heights of 10,000 university students closely follow a normal distribution, with a 
mean of 69.0 inches and a standard deviation of 2.5 inches, how many of these students 
would you expect to be at least 6 feet in height ? 

7. In a certain trade, the average wage is £7.20 per hour and the standard deviation is 
90p. If the wages are assumed to follow a normal distribution, what percentage of the 
workers receive wages between £6.00 and £7.00 per hour ? 

19.8.5 ANSWERS TO EXERCISES 

1. P ~ 0.8614 

2. (a) 0.0796 
(b) 0.0287 

3. 17. 

4. 0.0305 

5. 68.26 

6. 1151. 

7. 32.11 
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APPENDIX 

AREA UNDER THE STANDARD NORMAL PROBABILITY CURVE 

The area given is that from X = 0 to a given value, X\ 


Xi 

0.00 

0.01 

0.02 

0.03 

0.04 

0.05 

0.06 

0.07 

0.08 

0.09 

0.0 

0.0000 

0.0040 

0.0080 

0.0120 

0.0160 

0.0199 

0.0239 

0.0279 

0.0319 

0.0359 

0.1 

0.0398 

0.0438 

0.0478 

0.0517 

0.0557 

0.0596 

0.0636 

0.0675 

0.0714 

0.0753 

0.2 

0.0793 

0.0832 

0.0871 

0.0910 

0.0948 

0.0987 

0.1026 

0.1064 

0.1103 

0.1141 

0.3 

0.1179 

0.1217 

0.1255 

0.1293 

0.1331 

0.1368 

0.1046 

0.1443 

0.1480 

0.1517 

0.4 

0.1554 

0.1591 

0.1628 

0.1664 

0.1700 

0.1736 

0.1772 

0.1808 

0.1844 

0.1879 

0.5 

0.1915 

0.1950 

0.1985 

0.2019 

0.2054 

0.2088 

0.2123 

0.2157 

0.2190 

0.2224 

0.6 

0.2257 

0.2291 

0.2324 

0.2357 

0.2389 

0.2422 

0.2454 

0.2486 

0.2517 

0.2549 

0.7 

0.2580 

0.2611 

0.2642 

0.2673 

0.2704 

0.2734 

0.2764 

0.2794 

0.2823 

0.2852 

0.8 

0.2881 

0.2910 

0.2939 

0.2967 

0.2995 

0.3023 

0.3051 

0.3078 

0.3106 

0.3133 

0.9 

0.3159 

0.3186 

0.3212 

0.3238 

0.3264 

0.3289 

0.3315 

0.3340 

0.3365 

0.3389 

1.0 

0.3413 

0.3438 

0.3461 

0.3485 

0.3508 

0.3531 

0.3554 

0.3577 

0.3599 

0.3621 

1.1 

0.3643 

0.3665 

0.3686 

0.3708 

0.3729 

0.3749 

0.3770 

0.3790 

0.3810 

0.3830 

1.2 

0.3849 

0.3869 

0.3888 

0.3907 

0.3925 

0.3944 

0.3962 

0.3980 

0.3997 

0.4015 

1.3 

0.4032 

0.4049 

0.4066 

0.4082 

0.4099 

0.4115 

0.4131 

0.4147 

0.4162 

0.4177 

1.4 

0.4192 

0.4207 

0.4222 

0.4236 

0.4251 

0.4265 

0.4279 

0.4292 

0.4306 

0.4319 

1.5 

0.4332 

0.4345 

0.4357 

0.4370 

0.4382 

0.4394 

0.4406 

0.4418 

0.4429 

0.4441 

1.6 

0.4452 

0.4463 

0.4474 

0.4484 

0.4495 

0.4505 

0.4515 

0.4525 

0.4535 

0.4545 

1.7 

0.4554 

0.4564 

0.4573 

0.4582 

0.4591 

0.4599 

0.4608 

0.4616 

0.4625 

0.4633 

1.8 

0.4641 

0.4649 

0.4656 

0.4664 

0.4671 

0.4678 

0.4686 

0.4693 

0.4699 

0.4706 

1.9 

0.4713 

0.4719 

0.4726 

0.4732 

0.4738 

0.4744 

0.4750 

0.4756 

0.4761 

0.4767 

2.0 

0.4773 

0.4778 

0.4783 

0.4788 

0.4793 

0.4798 

0.4803 

0.4808 

0.4812 

0.4817 

2.1 

0.4821 

0.4826 

0.4830 

0.4834 

0.4838 

0.4842 

0.4846 

0.4850 

0.4854 

0.4857 

2.2 

0.4861 

0.4864 

0.4868 

0.4871 

0.4875 

0.4878 

0.4881 

0.4884 

0.4887 

0.4890 

2.3 

0.4893 

0.4896 

0.4898 

0.4901 

0.4904 

0.4906 

0.4909 

0.4911 

0.4913 

0.4916 

2.4 

0.4918 

0.4920 

0.4922 

0.4925 

0.4927 

0.4929 

0.4931 

0.4932 

0.4934 

0.4936 

2.5 

0.4938 

0.4940 

0.4941 

0.4943 

0.4945 

0.4946 

0.4948 

0.4949 

0.4951 

0.4952 

2.6 

0.4953 

0.4955 

0.4956 

0.4957 

0.4959 

0.4960 

0.4961 

0.4962 

0.4963 

0.4964 

2.7 

0.4965 

0.4966 

0.4967 

0.4968 

0.4969 

0.4970 

0.4971 

0.4972 

0.4973 

0.4974 

2.8 

0.4974 

0.4975 

0.4976 

0.4977 

0.4977 

0.4978 

0.4979 

0.4979 

0.4980 

0.4981 

2.9 

0.4981 

0.4982 

0.4983 

0.4983 

0.4984 

0.4984 

0.4985 

0.4985 

0.4986 

0.4986 

3.0 

0.4987 

0.4987 

0.4987 

0.4988 

0.4988 

0.4989 

0.4989 

0.4989 

0.4989 

0.4990 

3.1 

0.4990 

0.4991 

0.4991 

0.4991 

0.4992 

0.4992 

0.4992 

0.4992 

0.4993 

0.4993 

3.2 

0.4993 

0.4993 

0.4994 

0.4994 

0.4994 

0.4994 

0.4994 

0.4995 

0.4995 

0.4995 

3.3 

0.4995 

0.4995 

0.4996 

0.4996 

0.4996 

0.4996 

0.4995 

0.4996 

0.4996 

0.4997 

3.4 

0.4997 

0.4997 

0.4997 

0.4997 

0.4997 

0.4997 

0.4997 

0.4997 

0.4998 

0.4988 

3.5 

0.4998 

0.4998 

0.4998 

0.4998 

0.4998 

0.4998 

0.4998 

0.4998 

0.4998 

0.4998 

3.6 

0.4998 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

3.7 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

3.8 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.4999 

0.5000 

0.5000 

0.5000 
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